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Preface 


This volume contains a current survey to the end of 2011 of seasonal or calendar 
anomalies by Constantine Dzhabarov and me focusing on the U.S. stock market. 
In addition, there are reprints of various key papers of mine on U.S., Japanese and 
other calendar and fundamental anomalies plus the related topics of strict arbitrage 
and risk arbitrage. 

The reprinted papers are arranged chronologically starting with the early 1987 
paper on the turn-of-the-year effect with Ross Clark along with a description of 
this trade. An update of the turn-of-the-year effect to 2011 is in the Dzhabarov— 
Ziemba survey paper. But papers on specific topics are grouped together. Donald 
Hausch and I in 1990 wrote two papers on pure arbitrage where you construct a 
situation such that you cannot lose and likely can gain. Arbitrage and risk arbitrage 
are discussed for the game of Jai Alai in my 2008 paper with Daniel Lane. Condi- 
tions for pure arbitrage are presented as well as approximations for risk arbitrage 
where one cannot guarantee profits but expected returns or expected utility can be 
large. 

This is followed by the 1995 risk arbitrage paper on Nikkei put warrant mar- 
kets with Julian Shaw and Edward O. Thorp. There we describe a AA’ trade in 
which A is bought long and A’ which is similar but differently priced is shorted. 
Edward O. Thorp won the over $1 million risk adjusted trading contest held by 
Barron’s in 1990 based on this trade. Interestingly, those in Canada who could not 
price properly the over priced put warrants in the end made about $500 million 
because of how much the Japanese stock market dropped, namely about 56% in 
its 1990 move down. All this was predicted by my bond-stock earnings yield crash 
model as the brief note from Wilmott in Chapter 6 indicates. 

Going to Japan in 1988-1989 as the first Yamaichi Visiting Professor of Finance 
was a unique experience. It started me on a path on practical investment manage- 
ment at a high level that merged nicely into my nine years as the main consultant 
to Frank Russell’s research department from 1989 to 1998 when they sold the com- 
pany. While in Japan, I studied land and stock prices a lot as that’s where the 
action was. The stocks had much land holdings and both were used as collateral 
to buy more of the other. Japanese land was the most expensive in the world with 
a square meter in central Tokyo evaluated at nearly $300,000 at the peak. Doug 
Stone and I wrote the Journal of Economic Perspectives article describing this. I 
recall the editor, the great Joseph Stiglitz, sending me nine single spaced pages of 
suggestions of which we followed up enough to get the paper published. In the end, 
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when the Japanese stock market crashed in 1990 and land started to fall in 1991, 
there was about $5 trillion lost in land and another $5 trillion lost in stocks. Too 
bad that the Japanese did not invest more wisely. They simply bought what they 
already owned, namely their own land and stocks. Despite lots of overpriced pur- 
chases, see Ziemba and Schwartz (1992) for lists, they only had 3% of their assets 
invested abroad. Japanese stock market anomalies are studied in the 1991 Japan and 
the World Economy paper. My late friend, the twentieth century’s most important 
economist, Paul Samuelson, wrote to me saying that the Japanese stock market was 
held together by chewing gum. He was right as it did collapse. But he also felt it was 
not based on the economics. However, I found that basically all the U.S. anomalies 
were there but on slightly different dates. For example, the turn-of-the-month that 
in the U.S. was on days —l to +4 was on —5 to +2 in Japan. The reason being that 
salaries and sales pushes started then. This paper has a comprehensive analysis of 
Japanese anomalies paralleling the U.S. research. 

Cappoza and Ziemba’s 1993 paper discusses the design and some trading expe- 
rience concerning hedge funds based on anomalies. 

There are three 1994 papers. My late Finnish friend Teppo Martikainen, Jukka 
Perttunen and I studied the turn-of-the-month effect during the period January 
1988 to January 1990 in various markets across the world. The effect was working 
then and its reasons are related to new money coming into the market on the —1 
or +1 days. See also the Russell research report in Chapter 6 and the newspaper 
clippings there concerning the turn-of-the-month effect. Chris Hensel, another late 
colleague from Frank Russell and I studied some anomalies focussing on the turn-of- 
the-month effect over the long period 1928-1993 for the U.S. Chris Hensel, Gordon 
Sick and I in 1993 studied the turn-of-the-month effect the U.S. stock index futures 
markets from 1982-1992. Chapter 6 also has a Frank Russell research report Chris 
Hensel and I wrote in 1995 with European, North American, Pacific and worldwide 
January barometer results. 

The European Journal of Operational Research survey paper I wrote is broader 
and covers fundamental as well as seasonal anomalies. A monthly model we made 
in Japan to rank all the Tokyo first section stocks from 1 to N using thirty fun- 
damental factors is discussed. This follows in spirit the Jacobs and Levy (1988) 
model for the U.S. that started their very successful investment management firm. 
The top seven factors future projected earnings over price which is a forward look- 
ing PE ratio, the ordinary trailing PE ration, small cap and two mean-reversion 
factors. With monthly revisions and yearly re-estimation, the model performed 
well. I consulted ith Buchanan Securities in London on a similar model. They 
bought our book, Ziemba and Schwartz (1991), and discovered that in the mid- 
dle of the book was a better factor model than what they were using. The model 
from Japan was estimated with rising stock prices but, slightly modified, worked 
well in a declining stock market. Buchanan was in the business of going long cheap 
warrants and shorting stocks and the model helped their selection process. The 
European Journal of Operational Research paper discusses seasonally effects such 


Preface xiii 


as the turn-of-the-month, holiday, the January barometer, time-of-day, the Value 
Line enigma and other topics. 

My 2003 monograph on the Stochastic Programming Approach to Asset-Liability 
and Wealth Management published by Association for Investment Management 
and Research (AIMR) studies the 2000-2002 U.S. stock market crash among other 
topics. This uses the bond stock earnings yield model first presented in the 1991 
book Invest Japan. The model was discovered from the 1987 crash and predicted 
many crashes denned as a fall of 10%+ within a period of time about 4-12 months, 
usually from the start. The model called this internet bubble crash. In the 2005 
paper, Finnish colleagues Matti Koivu and Teemu Pennanen and I study the co- 
integration properties of the bond stock earnings yield differential (BSEYD) model 
focusing on the U.S., U.K. and Germany. The 2008 paper with Klaus Berge and 
Giorgio Consigli shows that if you stay in the stock market when the BSEYD model 
is not in the danger zone and out of the market when it is gives about double the 
final wealth that buy and hold provides with lower risk. See Lleo and Zimba (2012) 
to 2006-2009 predictions. 

Anomalies and behavioral biases are highly related and both yield advantages 
for the investor. In horse race markets, the most important behavioral bias is the 
tendency for the best, i.e., lowest odds horses to be under bet and the longshots over 
bet. Indeed, longshots at 100-1 have true value about 700-1. This has been known 
bookmakers for at least 100 years. In the 2008 Handbook on Sports and Lottery 
Markets, which I edited with my long time racing research colleague Donald B. 
Hausch, we study this favorite-longshot and other biases. We have used such ideas 
in various betting strategies discussed in our other books such as Ziemba and Hausch 
(1984, 1986, 1987, and 2008) and Hausch, Lo and Ziemba (1994, 2008). My paper 
on the efficiency of racing, sports and lottery betting markets surveys such biases in 
many markets. Included are favorite-longshot graphs for horse race betting markets 
and how they evolved over time, football, basketball efficiency and discussions of 
our weak form inefficiency in racetrack place and show markets as well as lottery 
applications including inefficiencies with unpopular numbers. 

In my 2008 paper with Robert Tompkins and Stewart Hodges, we show that 
S&P500 futures options display biases similar to the favorite-longshot bias. This 
yields ideas for hedge fund trading strategies that I have used extensively. 

Finally, my 2008 paper with Marshall Gramm discusses a very interesting race- 
track anomaly. This concerns the Kentucky Derby, Belmont Stakes and other races. 
In these two races, the horses have never run the distance they are about to run. 
So maybe their breeding will forecast if they have enough speed and stamina to 
win these tough races. This anomaly, like many others, has changed over time but 
still seems to be there in some form. Rod Bain, Donald Hausch and I, in our 2006 
paper, look deeper into the dosage theory applied to the Kentucky Derby during 
1981-2005, showing the great advantage of this anomaly. 

Thanks go to Constantine Dzhabarov for working with me the last few years on 
stock market anomalies that led the to the main paper of this book. Discussions with 
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the great investors Edward O. Thorp and Blair Hull as well as my many authors 
listed above have been helpful over the years. Finally, my wife, Sandra Schwartz, 
helps me in a myriad of ways as a sounding board, critique and producer of the 
books. 


William T. Ziemba 
Vancouver, January 2012 
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Chapter p 
Introduction — Calendar Anomalies! 


Constantine S. Dzhabarov 


Alpha Lake Financial Analytics Corp, Canada 


William T. Ziemba 


University of British Columbia, Canada 
ICMA Centre, University of Reading, UK 


This chapter is a survey of seasonal anomalies. Ziemba has been involved in the 
research and trading of such anomalies as the January turn-of-the-year effect since 
1982. His research plus that of other academics plus the very useful practitioner 
research of Yale Hirsch’s Stock Trader’s Almanac starting in 1972 is reviewed. (We 
academics reference Hirsch but the Hirsches operate in a closed economy, not ref- 
erencing others.) The discussion begins with an assessment of why the seasonal 
anomalies are so controversial but valuable and discusses some survey papers and 
books. Then beginning with the seminal anomaly, the January small firm effect, 
we discuss various other anomalies and their use in strategies including the con- 
struction of seasonality calendars that rank the various trading days of the year. 


1.1 Introduction to Seasonal Anomaly Effects 


Seasonality of stock markets has a long history despite the academic research being 
dominated by efficient market theory as surveyed by Fama (1970, 1991). Small firm 
effects were popularized by University of Chicago students Banz (1981), Reinganum 
(1981), Blume and Stambaugh (1983), Roll (1983), and Ritter (1988) among others. 


1 Dedicated to the memory of Merton H. Miller, Ziemba’s co-host in 1996 at the Graduate School 
of Business, University of Chicago, and to the memory of Chris Hensel, Ziemba’s co-author of 
many anomaly papers at the Frank Russell Company and University of Chicago MBA. It was 
there in the early 1980’s when Banz, Blume, Keim, Reinganum, Ritter, Roll, Stambaugh and 
other students at the most strongly efficient market oriented US finance department published 
small stock market anomalies papers in top finance journals and opened up the area. Miller, a 
strong efficient market academic but also savvy practical student of the markets used to tell me: 
“The half life of an anomaly is three years.” Ziemba’s experience since 1982, when he first traded 
the turn-of-the-year effect in the futures markets, is that when markets are regular (not too high 
volatility), the anomalies tend to work. But each year or play usually is slightly different and may 
move around. So constant research and careful risk control is important in using these results in 
trading. The true test is can you use them to make excess risk adjusted profits and Ziemba believes 
this to be the case. This monograph also updates some of the results from Dzhabarov and Ziemba 
(2010, 2011) and various anomaly papers Ziemba has published. 


2 Calendar Anomalies and Arbitrage 


Early surveys are in Lakonishok and Smidt(1988), Thaler (1992) and Ziemba 
(1994). The latter references considerable regularity of various seasonal anomalies 
in Japan as well as in the U.S. Jacobs and Levy (1988abc) have used seasonal 
and fundamental factor model derived anomalies to create a multibillion dollar 
investment firm. Dimson (1988) and Keim and Ziemba (2000) present whole books 
with studies across the world. The Stock Traders Almanac discusses some such 
anomalies in yearly updates; see Hirsch and Hirsch (2011). 

Anomalies of the seasonal variety as discussed in this chapter and those based 
on fundamental and other factors in the rest of this book, and in Keim and Ziemba 
(2000) and Zacks (2011) are not fully accepted nor believed by many strong efficient 
market theorists. Part of this dismissal is that the anomalies are too small to be 
bothered with as Ross (2005) argues. So, more or less, does Fama (1970, 1991).The 
great financial empiricist Roll(1994) makes the startling statement that even with 
considerable resources, he has never been able to find a profitable anomaly. The 
well-known book Malkiel (2011) even states that strong effects like the January 
effect do not exist. Marquering, Nisser and Valle (2006) argue that the anomalies 
disappear after they are published, although some reappear; see also Dimson and 
Marsh (1999) and Schwert (2003). Hudson, Keasey and Littler (2002) and Lucey 
and Pardo (2005) discuss how anomalies are affected by papers published on them. 

There also is the serious issue of data mining. Indeed, many results are in-sample 
and true tests out-of-sample. Statistical verification of the actual existence of signif- 
icant seasonal anomaly effects is studied by Sullivan, Timmerman and White (1999) 
who analyze 9452 calendar based trading rules. See also Hansen, Lunde and Nason 
(2005) who study 181 calendar effects and Lo and MacKinley (1990) who discuss 
data snooping biases. Also t values tend not to show statistical significance in many 
cases where successful trades have been made because of high standard deviations. 

Rather than debate such people, Ziemba and Ziemba (2007) simply argued that 
there are five basic stock market camps. Each has a cut or version of certain sections 
of the market and makes its point for a certain subset of market participants, 
instruments and strategies. There may be other classifications but these provide a 
useful framework for discussion. 


The Five Groups are: 

1. Efficient markets (E) 

2. Risk premium (RP) 

3. Genius (G) 

4. Hog wash (H) 

5. Markets are beatable (A) 


The first group are those who believe in efficient markets (E). They believe that 
current prices are fair and correct except possibly for transactions costs. These 
transaction costs, which include commissions, bid-ask spread, and price pressures, 
can be very large.” 


2A BARRA study by Andy Rudd some years ago showed that these costs averaged 4.6% one-way 
for a $50,000 institutional investor sale of small cap stock. This is if you use a naive market order 
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The leader of this school which had dominated academic journals, jobs, fame, 
etc. in the 1960s to the 1980s was Eugene Fama of the Booth School of Business, 
University of Chicago. A brilliant researcher, Fama is also a tape recorder: you 
can turn him on or off, you can fast forward or rewind him or change his volume, 
but you cannot change his views no matter what evidence you provide; he will 
refute it forcibly. In the aggregate there is much to support his case. One such 
example is the $300+ million gift from former student David G. Booth to name 
the Chicago Graduate School of Business earned from fees from his index fund 
firm Dimension Fund Advisors (DFA) founded with another Chicago student Rex 
Sinquefield. Booth never got his Ph.D. but Fama helped him get a job and later in 
1981 DFA was founded with Fama as a key advisor. Some multi-billion plus later 
in fees shows how low fees with simple strategies can add up with huge volume. 
Booth has a co-authored paper with Donald Keim on the January effect in the cash 
market in the Keim and Ziemba (2000) anomalies book. 

This group provided many useful concepts such as the capital asset pricing model 
of Sharpe (1964), Lintner (1965) and Mossin (1966), which provided a theoretical 
justification for index funds, which are the efficient market camp’s favored invest- 
ment mode. They still beat about 75% of active managers. Since all the managers 
comprise the market, that’s 50% of them beaten by the index. Transactions (com- 
mission plus market impact) such as exchange taxes, bid-ask spread and other costs 
eliminate another 25%. See Ziemba and Schwartz (1991:44—46) for examples of how 
few funds beat the index across the world. In a sample of 167 funds, only 48 (28.7%) 
beat the benchmark. 

Over time the hard efficient market line has softened into a Risk Premium (RP) 
camp. They feel that markets are basically efficient but one can realize extra return 
by bearing additional risk. They strongly argue that, if returns are above average, 
the risk must be there somewhere; you simply cannot get higher returns without 
bearing additional risk. For example, beating the market index S&P500 is possible 
but not risk adjusted by the capital asset pricing model (CAPM). They measure 
risk by Beta, which must be greater than one to receive higher than market returns. 
That is, the portfolio risk is higher than the market risk. But they allow other risk 
factors such as small cap and low book to price. But they do not believe in full 
blown 20-30 factor models such as used by Jacobs and Levy (1988) for the U.S. 
and Schwartz and Ziemba (2000) for Japan. Rather they prefer to use just a few 
factors and small cap and price to book value are favorites. Ziemba recalls Barr 
Rosenberg focusing on small cap and low price to book as the key factors in 1967, 
see Rosenberg, Reid and Lanstein (1985). Later, Fama and French (1992) took the 
credit for these ideas with a more complete study. Fama and his many disciples 
moved to this camp in the 1990s. This camp now dominates the top U.S. academic 


for the full transaction rather than limit orders or smaller market orders. Thorp, in a private com- 
munication, told Ziemba that he traded about $60 billion in statistical arbitrage from 1992-2002 in 
lot sizes of 20K to 100K and found that the mean transaction cost was about 1 cent per share and 
the market impact was about 4.5 cents per share for shares averaging about $30. So the one-way 
costs were about 0.18%. 
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journals and the jobs in academic finance departments at the most famous business 
schools in the U.S. and Europe. 

The third camp is called Genius (G). These are superior investors who are bril- 
liant or geniuses but you cannot determine in advance who they are. The late MIT 
economist Paul Samuelson championed this argument. Samuelson felt that these 
superior investors do exist but it is useless to try to find them as in the search for 
them you will find 19 duds for every star. Surprisingly, Samuelson was an early 
investor in the very successful futures trading operation Commodity Corporation 
run, partly, by one of his MIT students. This view is very close to the Merton— 
Samuelson criticism of the Kelly criterion: that is, even with an advantage, it is 
possible to lose a lot of your wealth. See MacLean et al. (2011) for simulations 
discussing this point and Thorp and Ziemba’s (2012) response to private letters 
received from Samuelson. The evidence though is that you can determine some 
superior investors ex ante and to some extent they have persistent superior per- 
formance, see Fung et al. (2006), Jagannathan et al. (2006), Ziemba (2005) and 
Gergaud and Ziemba (2012). Soros did this with futures with superior picking of 
commodities and currencies to bet on; this is the traders are made not born philos- 
ophy. This camp will isolate members of other camps such as in (A) or (H). 

The forth camp is as strict in its views as camps (E) and (RP). This group feels 
that efficient markets that originated in and is perpetuated by the academic world 
is hogwash (H). In fact the leading proponent of this view, and one with whom 
it is hard to argue as he tops the lists of richest people and greatest investors, is 
Warren Buffett, who wants to give university chairs in efficient markets to further 
improve his own very successful trading. An early member of this group, the great 
economist John Maynard Keynes was an academic. We see also that although they 
may never have heard of the Kelly criterion, this camp does seem to use it implicitly 
with large bets on favorable investments. See MacLean, Thorp and Ziemba (2011). 
Ziemba and MacLean (2011) present a table of actual asset positions in a George 
Soros fund where about half the portfolio is invested in one position and equity 
weights near 10% for several positions are in Warren Buffett’s Berkshire Hathaway. 
They do not care about monthly losses, and have many, but they focus instead on 
high long term wealth growth. So they resemble Kelly bettors. 

This group feels that by evaluating companies and buying them when their value 
is greater than their price, you can easily beat the market by taking a long-term 
view. They find these stocks and hold them forever. They find a few such stocks 
that they understand well and get involved in managing them or they simply buy 
them and make them subsidiaries with the previous owners running the business. 
They forget about diversification because they try to buy only winners. They also 
bet on insurance when the odds are greatly in their favor. They well understand 
tail risk which they only take at huge advantages to themselves when the bet is 
small relative to their wealth levels. Indeed Buffett’s long-term approximately 15- 
18 year bet shorting S&P500 over-the-counter puts, for which he received much 
incorrect criticism, sure looks good now with the S&P500 closing 2011 at 1257.60 
and in the 1400 area in April 2012. The odds favor him not to have to pay back 
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the $4.6 billion premium he is now using for trades like the 10% loans to GE and 
Goldman Sachs. Calculations indicate that Buffett collected about twice the fair 
value of these American put options that cannot be exercised until expiry many 
years in the future. 

The last group are those who think that markets are beatable (A) through 
behavioral biases, security market anomalies and other research using computerized 
superior betting techniques. They construct risk arbitrage situations with positive 
expectation. They research the strategy well and follow it for long periods of time 
repeating the advantage many times. They feel that factor models are useful most 
but not all of the time and show that beta is not one of the most important variables 
to predict stock prices. They use very focused, disciplined, well researched strate- 
gies with superior execution and risk control. Many of them use Kelly or fractional 
Kelly strategies. All of them extensively use computers. They focus on not losing, 
and they rarely have blowouts. Members of (A) include Edward O. Thorp (Princeton 
Newport and later funds), Bill Benter (the Hong Kong racing guru), John Henry 
(the Red Sox owner), Blair Hull (the mispricing of options guru in Chicago), Harry 
McPike (a trend follower), Jim Simons (Renaissance hedge fund), Peter Muller (of 
Morgan Stanley) Jeff Yass (Susquehanna Group) and David Swensen (who runs 
the Yale University endowment). Most of these Ziemba has had dealings with or 
consulted for. This group has made many billions trading. Blowouts occur more in 
hedge funds that do not focus on not losing and true diversification and over-bet; 
when a bad scenario hits them, they get wiped out, such as Long-Term Capital Man- 
agement (LTCM), Niederhofer, and Amarath, etc.; see Chapters 11-13 of Ziemba 
and Ziemba (2007). 

So we will proceed assuming we are in category A and look at the data, possible 
explanations and some trading results. Statistical significance is frequently an issue 
even in long sequences of successful trades because of high standard deviations. 


1.2 January Effect 


We refer to the January effect as the tendency of small cap stocks to outperform 
large cap stocks in the month of January. Rozeff and Kinney (1976) showed that 
equally weighted indices of all the stocks on the New York Stock Exchange (NYSE) 
had significantly higher returns in January than in the other eleven months during 
1904-1974. Keim (1983) documented the magnitude of the size effect by month using 
1963-1979 data. He found that half the annual size premium was in January. Blume 
and Stambaugh (1983) showed that, after correcting for an upward bias in mean 
returns for small stocks that was common to earlier size effect studies, the size effect 
was only in January. Figure 1.1 shows the historical evidence from January 1926 to 
December 1995 of the difference in January between the lowest decile and the highest 
decile by market capitalization of the NYSE index plus American Stock Exchange 
(AMEX) and Nasdaq stocks of similar size. Only five years out of 70 did small 
caps underperform in January and in most years, the small cap outperformance is 
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R(10th Declle) - R(1st Declle) (Percent) 
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January Only 


Figure 1.1: January Effect, 1926-1995. January Size Premium = R(10'") — R(1*). 
Source: Booth and Keim (2000) 
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Figure 1.2: Russell 2000 — S&P500 Futures Spread Average Returns During Various Months, 
1993-2011. 


considerable. The Rigin — Rise decile returns averaged 4.48% with a t = 2.83 from 
January 1982 to December 1995. 

To update, we calculated the Russell 2000/S&P500 futures spread by month 
from 1993 to December 2011. All data in this chapter is updated to the end of 
2011. As argued by Rendon and Ziemba (2007), the January TOY effect still exists 
but has moved to December. Indeed, Figure 1.2 shows that the small cap/large cap 
spread is positive in December and negative in January. 
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Figure 1.3: S&P500 Futures Average Monthly Returns. 


The January monthly effect for small and large cap stocks measured by 
the Russell 2000 and S&P500 futures has been negative during January 1993 — 
December 2011 and January 2004 — December 2011. Figures 1.3ab and 1.4ab show 
the results with the data in Tables A.2 and A.3 in the Appendix. The results show 
the historically expected very negative October in the recent S&P500 data and 
in both sets of Russell 2000 data. Surprisingly, the historically strong months of 
November, January and February were negative for both the small and large cap 
data recently. While most of the other seasonality effects have still produced valuable 
reliable anomalies, the monthly effect does not look to be of much use for traders 
and investors. But sell in May and go away, discussed below, does add value. 
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Figure 1.4: Russell 2000 Futures Average Monthly Returns. 


Several subsequent analyses built on Keim’s study and considered the possibility 
that the January effect was diminishing based on the inclusion of later years of data, 
but Easterday, Sen and Stephan (2008) also expanded their study to include years 
before Keim’s analysis, which allowed them to better assess trends in the January 
Effects magnitude. They included the years from 1946-2007, performing a time 
series analysis according to the three sub-periods in relation to Keim’s 1963-1979 
window: before, during, and after. Over this period, they studied NYSE and AMEX 
firms and, from 1971 onwards, they also considered NASDAQ firms, which allowed 
them to consider more small cap stocks in their analysis. 
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Contrary to studies based on the Keim period and later years, Easterday et al. 
do not conclude that the January effect is declining. In other words, they do not find 
evidence that investors are acting on the arbitrage opportunity and internalizing it 
into higher prices. Instead, they find that the January effect continues to be robust 
in small firms and that, in recent years, it has not so much diminished as returned 
to a level similar to the effect exhibited prior to 1963. Easterday et al. also con- 
sidered trading volume in January, which should be higher if investors are actively 
arbitraging the January effect opportunity, but they did not find any evidence of 
higher trading volumes. 

Haug and Hirschey (2006) also extensively analyzed the January effect, using 
both value-weighted and equal-weighted equity returns. Their findings concur with 
Easterday and particularly note the consistency of the January effect in small cap- 
italization stock returns across time. For instance, they find that the difference in 
average mean value-weighted portfolio return is 0.40% from 1802-2004, and that 
this number is even greater, 0.61%, from 1952-1986 (roughly the period Keim stud- 
ied expanded two-fold). 

Haug and Hirschey explore potential explanations of the January effect phe- 
nomenon, ruling out biases that would more markedly affect large capitalization 
stocks, such as the timing considerations of institutional investors during portfolio 
rebalancing around official reporting periods. Statistical arguments brought up by 
Sullivan, Timmerman and White (1999), among others, center around the inher- 
ent statistical problem of testing an empirical aspect of a data set using the same 
data set, which fundamentally calls into question the underlying statistical methods 
used in the analysis. Additionally, two theories concerning relatively small investors 
concern end-of-year tax considerations or income events, such as year-end bonuses, 
which lead to new purchases in the new year. However, both of these potential expla- 
nations come into question when considering international indices under different 
tax regime timing and across changes in tax laws that should have an effect. 

Among other measures, Haug and Hirschey use the Tax Reform Act of 1986 to 
test these and other behavioral hypotheses as potential explanations of the January 
effect, but they reach contradicting conclusions using different data, namely value- 
weighted and equal-weighted returns. They ultimately conclude that each of these 
explanations remain potential but unproven drivers of the still perplexing January 
effect phenomenon. 


1.2.1 Trading the January small cap effect in the futures markets 


The evidence suggests that small stocks outperform large stocks at the turn of the 
year. Yet, transactions costs, particularly bid-ask spreads and price pressures, take 
away most, if not all, of the potential gains. See e.g., Stoll and Whaley (1983) and 
Keim (1989) on these effects in January. However, transaction costs to trade index 
futures are a tenth or less of those for the corresponding basket of securities, and 
even more important, there is much less market impact. Hence, it may be profitable 
to buy long positions in small stock index futures and sell short positions in large 


10 Calendar Anomalies and Arbitrage 


stock index futures. This pair of positions is known as a spread trade. The strategy 
must anticipate the effect in the market place, in particular, the price impact of 
buying and selling futures contracts. Stock index futures began trading in the U.S. 
in May 1982. The Value Line minus S&P500 and Russell 2000 minus S&P500 spreads 
are two ways to measure and possibly capture any advantage small stocks may have 
over large cap stocks. 

Ziemba started doing this trade for the 1982/1983 TOY with Vancouver com- 
modity broker Ross Clark. Clark and Ziemba (1987) describes these markets and 
the Value Line/S&P spread. See also the synopsis of this trade in Chapter 6. At 
that time the Value Line, which had about 1700 stocks was geometrically weighted. 
Hence, by the geometric-arithmetic inequality this produced a downward drift of 
about i% per month. After 1988 the index became price weighted arithmetic. Clark 
and Ziemba used the following trading rule: 


buy the spread on the first closing uptick, starting on December 15 and 
definitely by the 17**, and sell on January 15. Waiting (to enter) until 
(—1) now seems to be too late: possibly finance professors and their col- 
leagues, as well as other students of the turn-of-the-year/January effect 
who are in on the strategy, move the VL index. There seems to be a bid- 
ding up of the March VL future price relative to the spot price. (Clark 
and Ziemba, 1987:805) 


Their idea at that time was that the January small firm effect existed and occurred 
during the first two weeks of January in the cash market (as argued by Ritter, 1988; 
see also comments by Ziemba, 1988), but that futures anticipation would move the 
effect in the futures markets into December. Hence, an entrance into the Value 
Line/S&P500 futures spread trade in mid-December and an exit in mid January 
should capture the effect if it actually existed. With data up to the 1985/1986 TOY, 
their trade rule was successful. They concluded that small cap advantage was mainly 
in the first half of January, with some anticipation in the final days of December, 
and with a large cap advantage in the second half of January. 

Ziemba continued trading this spread for the 14 TOY’s (all winners) from 1982- 
1983 to 1995-1996 and updated the results in Ziemba (1994) and Hensel and Ziemba 
(2000). 

Hensel and Ziemba (2000) analyzed the January effect in the futures markets 
and concluded that for the 1980s and early 1990s there was a small cap advantage 
in the futures and cash markets. However, they show that from 1994 to 1998 there 
was no advantage in the cash market, and that anticipation built up in the last half 
of December in the futures markets. As a consequence, for the four TOYs during 
the 1994-1998 period, the January effect only existed in the last half of December, 
in the futures market. They analyzed small minus large spread trades between the 
Value Line and the S&P500 futures contracts and concluded that the January effect 
was exploitable in the futures markets in this period. 

Rendon and Ziemba (2007) updated Hensel and Ziemba (2000) to analyze the 
seven TOYs from 1998-1999 to 2004-2005 for the Value Line minus S&P500 spread 
trade, and provided additional evidence by analyzing a second spread trade involving 
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the Russell 2000 and the S&P500 futures contracts. From 1998 to 2005, their analysis 
shows that the January effect is still present in the futures markets in the Value 
Line minus S&P500 spread trade, but that it has become increasingly risky to try 
to exploit it because of the marginal liquidity of the Value Line stock index futures 
contract. For the Russell 2000 minus S&P500 spread trade, the January effect has 
been profitable. 

Rendon and Ziemba (2007) investigated the day by day cash and futures returns 
of the spread trades: Value Line minus S&P500 and Russell 2000 trading, as well as 
for the 12 TOYs for which there was Russell 2000 futures trading up to 2004/2005. 

Rendon and Ziemba (2007) computed the cash index and March futures spreads 
(Value Line 500 minus S&P500, Value Line 100 minus S&P500, and Russell 2000 
minus S&P500) day by day in December and January. The 500 and 100 refer to the 
contract size, $500 or $100 per point, respectively. The spread between the futures 
difference and the cash difference represents the futures anticipation. Table 1 in 
Rendon and Ziemba (2007) (not reprinted here) summarizes, for each spread, the 
anticipation or lack of it in the second half of December, during the TOY (trading 
days —1 to +4), the rest of the first half of January and the second half of January 
for all the TOYs. Some years were easy and others hard to make profits. Figure 1.5 
shows eight typical turn-of-years plots for December and January: 


1. A typical trade for 1999/2000 showing the futures and cash spreads (TOY) for 
the Value Line 500 minus S&P($500 value). The dotted line is the futures spread 
and the dark line is the cash spread. In this case you could enter at a discount 
in mid December. The trade gained but you had to cash out in mid January at 
a discount. 

2. The 1997-1998 TOM also for the VL500/S&P500 spread had all the advantage 
in December then the spread declined all January. You had to buy at a premium 
but you could sell for fair value. If you did not sell at the end of December you 
would have lost some or all of your gains. Traders seeing such a trend in January 
would likely have cashed out. 

3. The VL100/S&P500 for 2003-2004. The TOM had virtually no futures spread 
discounts or premiums. Since the spread increased continuously from 
December 17 traders easily made profits. 

4. The TOM for 2000-2001 for the VL100/S&P500 value-adjusted for the cash and 
March futures. The spread with some volatility increased throughout December 
and January so the trade made profits regardless of when it was entered and 
exited. There were premiums or discounts throughout December and January. 

5. In 1994-1995 for the R2000/S&P500 there were large gains in December and the 
spread peaked at the end of December. Those who did not cash out then lost 
some of their gains in January. 

6. In 1993-1994 for the R2000/S&P500 spread you had to buy at a premium. Then 
the spread fell in December and non-true believers cashed out with losses but 
believers won when the spread gained in January and traders likely could have 
traded out at a futures spread discount. 
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7. The TOY for 2004-2005 for the R2000/S&P500 spread won in December and 
you could have bought the spread at a discount. But if you held the spread into 
January you would have given back some or all of your gains or even lost money. 

8. In 2001-2002 you had to buy the R2000/S&P500 spread at a premium but the 
trend was up until mid January. So traders made profits. 


The anticipation year by year patterns vary greatly. For the Value Line 500 
minus S&P500 spread, Hensel and Ziemba (2000) reported an apparent shift from 
an anticipation in the first half of January in the early years, to an anticipation 
during the second half of December in the late 90s, as well as a lack of small cap 
anticipation past the TOY, in January. For this spread trade, our results show little 
or no anticipation in the contracts last two years of existence. This was, most likely, 
related to the scarce liquidity of the contract in its final months. 

For the Value Line 100 minus S&P500 spread there is a positive anticipation dur- 
ing the second half of December to the TOY period. Further from this point, the 
results are very mixed, with an apparent dominance of positive anticipation, but 
with a clear trend towards no anticipation, especially in the 2003-2004 TOY, where 
there was little or no anticipation during the first half of December, and no antici- 
pation at all in the rest of the analyzed period. This is, again, consistent with the 
diminishing liquidity of the Value Line 100 contract, which reached extreme values 
in the 2004-2005 TOY, with an average of less than ten contracts traded daily and 
an average open interest of less than one hundred contracts. 

For the Russell 2000 minus S&P500 spread, positive anticipation dominates in 
all the time intervals for which we divided the TOY period. In the three TOYs, 
2002-2003, 2003-2004, and 2004-2005, negative anticipation dominated, but its 
size is not significant. Since 2000, the pattern seems to be changing from positive 
anticipation throughout the period to none or insignificant negative anticipation. 

For the Value Line 500 minus S&P500 spread, the three TOYs up to the ter- 
mination of this higher-multiplier contract in March 2000, showed a reversion to 
the original January effect. The declining liquidity of the contract up to its date of 
termination is clearly a factor in this reversal. This made the trade very risky and 
volatile. Large caps had the advantage in the second half of December and January, 
but small caps outperformed large caps during the (—1) to January 15 period. On 
average, the Clark and Ziemba (1987) rule would have yielded profits over the entire 
1982/1983 to 1999/2000 sample period, although this result is not statistically sig- 
nificant. This trade was successful in all but two of the TOYs in this sample. The 
trade produced statistically significant profits in the 14 TOYs from 1982-1983 to 
1995-1996, with a mean gain of 4.25 spread points, standard deviation of 2.81, a t 
value of 1.51 and 14 of 14 winners. The results are not statistically significant in the 
complete sample period because of losses in the 1996-1997 and 1997-1998 TOYs 
and high variability. 

For the Value Line 100 minus S&P500 spread, the results show that there was 
still a small cap advantage in the second half of December, followed by a large cap 
advantage in January. On average, this trade continued to be profitable since it 
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produced positive profits in all but one of the 7 TOYs studied. However the results 
were not statistically significant because of high variability. Moreover, as pointed 
out by Hensel and Ziemba (2000), the trade became increasingly riskier and volatile 
as the liquidity of the Value Line futures contract diminished. 

The Russell 2000 minus S&P500 trade is very interesting. There seems to be a 
clear pattern of anticipation of the January effect on this spread. The results show 
a definite large cap advantage in the second half of January, as it was in the earlier 
Value Line minus S&P500 studies. For the rest of the sub periods, the results are 
as follows: Out of 12 TOYs, large caps had the advantage in eight years during 
the (—1) to January 15 period. On the other hand, small caps had an advantage 
in 9 out of the 12 TOYs for the December 15 to (—1) period. On average, the 
Clark and Ziemba (1987) rule produced positive profits, although this result is not 
statistically significant. A special case is the December 15 to (—3) period illustrated 
in Table 1.1. In this case, the spread trade produces profits that are close to being 
statistically significant at the 10% level. This result suggests that the Hensel and 
Ziemba (2000) modification to the Clark and Ziemba (1987) rule for the Value Line 
vs. S&P500 spread trade should be modified so as to unwind the position at the (—3) 
day of the TOY. The 1999-2000 and 2000-2001 TOYs in the Russell 2000 sample 
were particularly strong, and introduce high variability in the sample. Tables 1.1 
and 1.2 show the mean gain, standard deviation and t statistic for the spread trade 


Table 1.1: Results From Russell 2000/S&P500 March Futures Spread Trades in Index Points on 
Various Buy/Sell Dates for the 18 Turn-of-the-Years 1993-1994 to 2010-2011. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—3) Jan 15 end Jan Jan 15 S&P vs. R2 
1993-1994 3.39 (2.22) 1.84 1.17 0.52 v 0.48 
1994-1995 8.03 (3.93) (5.49) 4.10 0.52 v 0.48 
1995-1996 5.26 (7.68) (2.96) (2.42) 0.51 v 0.49 
1996-1997 2.84 (6.02) (7.20) (3.18) 0.51 v 0.49 
1997-1998 6.12 (8.47) (6.52) (2.35) 0.53 v 0.47 
1998-1999 12.75 2.78 (15.33) 15.53 0.59 v 0.41 
1999-2000 23.85 9.17 12.30 33.02 0.61 v 0.39 
2000-2001 22.94 8.07 8.98 31.01 0.59 v 0.41 
2001-2002 2.92 (3.29) 5.15 (0.37) 0.45 v 0.55 
2002-2003 2.78 (6.27) 5.72 (3.49) 0.47 v 0.53 
2003-2004 2.17 17.69 (4.30) 19.86 0.51 v 0.49 
2004-2005 0.57 (19.02 8.12 (18.45) 0.52 v 0.48 
2005-2006 2.72 11.99 28.86 14.71 0.52 v 0.48 
2006-2007 0.36 0.13 1.39 0.23 0.53 v 0.47 
2007-2008 16.48 (24.66 17.77 (8.18) 0.51 v 0.49 
2008-2009 28.18 6.85 (8.62) 21.33 0.51 v 0.49 
2009-2010 14.64 1.20 (2.66) 15.84 0.51 v 0.49 
2010-2011 1.91 1.04 (19.15) 2.95 0.55 v 0.45 
Average 8.77 2.03 0.99 6.74 

StDev 8.86 10.28 11.70 13.85 

t stat 0.99 0.20 0.08 0.49 
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Table 1.2: Results From VL100/S&P500 March Futures Spread Trades in Index Points on Var- 
ious Buy/Sell Dates for the 19 Turn-of-the- Years 1982-1983 to 1999-2000. 


Difference Difference Difference Trade Gain Trade 

Dec 15 (—1) to Jan 15 to Dec 15 to Weights 
TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. VL 
1982-1983 1.50 3.05 0.65 4.55 0.5 v 0.5 
1983-1984 (0.70) 4.00 (4.90) 3.30 0.5 v 0.5 
1984-1985 1.10 3.55 2.90 4.65 0.5 v 0.5 
1985-1986 3.15 0.45 (2.60) 3.60 0.5 v 0.5 
1986-1987 2.75 (0.30) (9.85) 2.45 0.5 v 0.5 
1987-1988 8.15 (0.90) (0.25) 7.25 0.5 v 0.5 
1988-1989 3.50 (2.95) (1.70) 0.55 0.5 v 0.5 
1989-1990 (0.50) 1.85 (1.45) 1.35 0.5 v 0.5 
1990-1991 1.70 3.60 3.20 5.30 0.5 v 0.5 
1991-1992 (7.15) 10.20 13.80 3.05 0.5 v 0.5 
1992-1993 5.45 6.55 4.05 12.00 0.5 v 0.5 
1993-1994 4.65 a 1.15 4.65 0.5 v 0.5 
1994-1995 6.15 (1.65) (8.50) 4.50 0.5 v 0.5 
1995-1996 6.00 (3.75) (9.70) 2.25 0.5 v 0.5 
1996-1997 4.35 (15.90) (10.75) (11.55) 0.5 v 0.5 
1997—1998 10.70 (6.50) (12.30) 4.20 0.5 v 0.5 
1997-1998 1.01 (6.40) (6.42 (5.39) 0.64 v 0.36 
1998-1999 0.91 2.17 (28.19 3.08 0.60 v 0.40 
1999-2000 6.09 14.00 7.33 20.08 0.59 v 0.41 
Average 
1982-1998 3.18 0.08 (2.27 3.26 
Std Dev 4.10 5.90 6.89 4.74 
t stat 0.78 0.01 (0.33 0.69 
Average 
1998-2000** 2.67 3.26 (9.09 5.93 
Std Dev 2.96 10.24 17.91 12.97 
t stat 0.90 0.32 (0.51 0.46 


excluding these years. The January effect could be exploitable in the futures markets 
through the Russell 2000 minus S&P500 spread trade, which presents the additional 
advantage of greater liquidity, when compared to the Value Line vs S&P500 spread 
trade. There are periods for which the Value Line minus S&P500 trade had gains 
and the Russell 2000 minus S&P500 had losses. Although this is not the case in 
the years after 2000, when the trades have started to look very similar, it is not 
contradictory because the contracts are made up of different stocks, and the Russell 
2000 contract has a much higher liquidity. 

Tables 1.2 and 1.4 take data from Hensel and Ziemba (2000) and update the 
results for the Clark and Ziemba (1987) rule for the Value Line 500 minus S&P500 
and Value Line 100 minus S&P500 spread trades. All fourteen of the TOY trades 
from 1982-1983 to 1995-1996 showed profits. Figures 1.5a—h show the anticipa- 
tion, etc. But because of declining Value Line volume and consulting to Morgan 
Stanley in New York where he taught the Peter Muller group about the TOY and 
other trades, that group became very successful and had gains in the $5 billion 
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Table 1.3: Results From Russell 2000/S&P500 March Futures Spread Trades in Index Points 
on Various Buy/Sell Dates for the 12 Turn-of-the-Years 1993-1994 to 2004-2005. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. R2 
1993-1994 4.23 (2.22) 1.84 2.01 0.52 v 0.48 
1994-1995 5.84 (3.93) (5.49) 1.91 0.52 v 0.48 
1995-1996 5.28 (7.68) (2.96) (2.39) 0.51 v 0.49 
1996-1997 (1.63) (6.02) (7.20) (7.65) 0.51 v 0.49 
1997—1998 7.43 (8.47) (6.52) (1.04) 0.53 v 0.47 
1998-1999 9.74 2.78 (15.33) 12.51 0.59 v 0.41 
1999-2000 27.66 9.17 12.30 36.83 0.61 v 0.39 
2000-2001 27.10 8.07 8.98 35.17 0.59 v 0.41 
2001-2002 7.84 (3.29) 5.15 4.55 0.45 v 0.55 
2002-2003 (0.93) 6.27) 5.72 (7.20) 0.47 v 0.53 
2003-2004 1.06 17.69 (4.30) 18.76 0.51 v 0.49 
2004-2005 (0.34) (19.02 8.12 (19.36) 0.52 v 0.48 
Average 7.77 1.60 0.03 6.17 

Std Dev 9.86 9.72 8.24 16.96 

t stat 0.79 0.16 0.00 0.36 

without 2000 and 2001 

Average 3.85 3.64 (2.10) 0.21 

Std Dev 4.05 9.35 7.24 10.69 

t stat 0.95 0.39 (0.29) 0.02 


Table 1.4: Results From VL100/S&P500 March Futures Spread on Various Buy/Sell Dates for 
the 7 Turn-of-the- Years 1998-1999 to 2004-2005. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. VL 
1998-1999 0.35 1.15 (11.26) 1.50 0.23 v 0.77 
1999-2000 2.29 5.26 2.76 7.55 0.22 v 0.78 
2000-2001 8.68 15.81 7.72 24.49 0.24 v 0.76 
2001-2002 6.00 (2.13) 4.62 3.87 0.29 v 0.71 
2002-2003 0.66 (0.11) 0.50 0.54 0.32 v 0.68 
2003-2004 1.45 7.16 (0.87) 8.61 0.36 v 0.64 
2004-2005 4.42 (11.20) 6.31 (6.78) 0.37 v 0.63 
Average 3.41 2.28 1.40 5.68 

Std Dev 3.10 8.41 6.36 9.73 

t stat 1.10 0.27 0.22 0.58 


area (see Patterson, 2010). Ziemba stayed out of these markets until the 2009-2010, 
2010-2011 and 2011-2012 TOYs with the Russell 2000 mini futures. Figure 1.6 
shows these three trades, the round dots denote the entry and the squares the 
exit in our test anomaly account run by the authors; see Figure 1.29. So the old 
Clark-Ziemba trade modified seems to still work. Of course, there are lots of imple- 
mentation issues. 
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Figure 1.6: Russell 2000 — S&P500 Spread With Our Entries (Dots) and Exits (Squares). 
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1.3 The January Barometer 


Historically, returns in January have been a valuable signal for the returns in the 
following eleven months that year. If stocks have positive returns in January, then 
it is likely that the market as a whole will rise in that year. Hirsch (1986), who first 
mentioned this in 1972, has called this the January barometer. In the yearly updated 
Stock Trader’s Almanac by Jeffrey A. Hirsch and Yale Hirsch (2011), they define it 
as the full year rather than the last eleven months. We look at this both ways, full 
year and last eleven months, for the US S&P500 in Figure 1.8. The supposition is 
that: 


If the market rises in January, then it will rise for the year as a whole; but if it falls 
in January, then there will be a decline or a flat market that year. 


Figure 1.7 updates Hensel and Ziemba (1995a) and Ziemba (1994) which had the 
results for the 54 years 1940-1993. There are 72 years in the total sample with an 
18 year update to the end of December 2011. 

For the 72 years, when the return in January was positive, the rest of the year 
was up 84.4% of the time. This compares with 69.4% of all the years that the whole 
year was up. 

When the return in January was negative which was 27 of the 72 years, the 
rest of the year was down 48.1% of the time. Thus even in years when January 
is down, the whole year is about equally likely to be up or down. This 48.1% is 
significantly less than the 72.2% of all the years that the rest of the year went up. 
Figure 1.7 also shows the full year return for the four cases with arithmetic and 
geometric mean returns. We conclude that the January barometer does add value 
and is useful in various ways. Negative Januarys like 2008 had good predictive value. 
But the measure is not infallible. For example, 2010 had positive 11 month and 12 
month returns despite a negative January. But as in other cases of negative January 
but positive 11 and 12 month returns, those returns are, on average, small. 

In the 18 year update (1994-2011), the results as seen in Figure 1.7 are similar 
with the January up ROY up 72.7% (8 of 11) of the time. 

Figure 1.8(a) shows the cumulative rest of year returns for positive January, 
negative January and buy and hold. Historical buy and hold beats positive January 
and has the highest final wealth with negative Januarys producing almost no gains 
at all. Buy and Hold had returns that were high except the 2007-2009 drop in the 
S&P500 led to the positive January dominating. Figure 1.8(b) has the full year 
results. 

The equations for Figures 1.9 and 1.10 are: 


ROY = 0.0591 +0.794Jan R? = 6.5% 
(3.41) (2.21) 

ROY, = 0.1298 —0.324Jany R? = 0.7% 
(4.42)  (—0.55) 

ROY _ = —0.0840 —1.936Jan_ R? = 7.2% 
(—1.36)  (—1.39) 
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11 month 12 month 
mean return mean return 
arithmetic arithmetric 
geometric geometric 
38 years (84.4%) 15.1% 19.6% 
ROY up 14.8% 19.2% 
45 years January 
up 
7years (15.6%) -9.4% -5.4% 
ROY down 9.6% -5.6% 
72 years total sample 
1940-2011 
14years (51.9%) 11.2% 6.3% 
ROY up 10.8% 5.9% 
27 years January 
down 
13years (48.1%) -13.9% -16.7% 
ROY down -14.4% -17.2% 
(a) Total Sample 
11 month 12 month 
mean return mean return 
arithmetic arithmetric 
geometric geometric 
8years (72.7%) 16.4% 19.7% 
ROY up 16.0% 19.3% 
11 years January 
up 
3 years (27.3%) -7.6% -4.8% 
ROY down -7.8% 5.0% 
18 years update 
1994-2011 
4years (57.1%) 21.9% 16.4% 
ROY up 21.4% 16.0% 
7 years January 
down 
3 years (42.9%) -20.6% -24.0% 
ROY down -21.5% 24.9% 


(b) Update 


Figure 1.7: January Barometer Results, 1940-2011 and 1994-2011. 


Bronson (2011) reminds us that the January barometer has had six false positives 
since 1940, where January was up but the rest of the year was negative. In 1947, 
there were enough dividends and January returns to overturn this loss for the whole 
year. So that leaves the following five January positive net return of the year negative 
returns (not including dividends) as in Table 1.5. 

There have also been 14 false negatives since 1940 to 2010 where January was 
negative but the rest of the year is positive. We differ from Bronson by simply saying 
that if January is negative, the rest of the year is noise. So Bronson argues that the 
January barometer has failed to signal the direction of the stock market 19 of the 
past 71 years up to 2010, some 27% of the time. Agreeing with us, the stock market 
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Figure 1.8: Positive and Negative January and B&H Cumulative Returns. S&P500 Index (Cash), 


1940-2011. 


was up 73% of the time (52 of 71 years). But Bronson argues that the barometer 
is getting less accurate recently. Indeed 12 of the 19 failures (60%) have occurred 
in the 32 years since 1978. Figure 1.11 shows Bronson’s graph relating the rest of 
the year percent change as a function of January’s percent change. His regression 
suggests that the rest of the year percent change equals 6% plus 80% of January’s 
percent change. Compare this with our regression above with a minuscule 0.5% R? 
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Figure 1.9: January Return (x-axis) vs. Rest of Year Return (y-axis). S&P500 Index (Cash), 
1940-2011. 
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Figure 1.10: January Return (x-axis) (Up and Down Cases) vs. Rest of Year Return (y-axis). 
S&P500 Index (Cash), 1940-2011. 


and a rest of the year return of 12.6% minus 0.27 times January’s return. His low 
7% R? exceeds ours. He concludes that January 2011’s gain of 2.3% yields a forecast 
of a rest of 2011 gain of 7.8%. This gain is quite close to those we hear from the 
TV forecasters. 

Hirsch and Hirsch (2011) and Ziemba (2010) discuss this first five day of January 
predictor with data from 1950-2010. The last 37 positive first five days were followed 
by full year gains 32 times (86.5%) and a mean gain of 14.0% for the 37 years. The 
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Table 1.5: Returns for Positive 
January, Negative Rest of Year, %. 


Year January ROY Year 


1946 7.0 —17.6 —11.9 
1947 2.4 —2.3 0.0 
1966 0.5 —13.5 —13.1 
1987 13.2 —9.9 2.0 
1994 3.3 —4.6 —1.5 
2001 3.5 —16.0 —13.0 


Source: Bronson (2011) 


Regression Analysis of January (x-axis) with Rest of Year (y-axis) 
Rest-of-year % change equals +6.0% plus 80% of January's % change witha very The Jan"11 gain of 2.3% suggest the rest of 2011 will 


i E 9; = 0; 2-79 i likely gain 7.8%, but the very high standard deviation of 
high standard deviation of +/-14.9%, r= +26% and r? = 7%, which are very low 14.9% suggests the odds are two-thirds that the restof 
m year will come in between -7% and +22%, and one-third 
The 2nd- thru 6th-degree polynomial best fits are also graphed (maximum r? = 32%), illustrating odds that it will be between -22% and +37%. 
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Figure 1.11: Regression Analysis of January Return With Rest of Year. 


Source: Bronson (2011) 


23 negative first five days had 12 positive (52.2%) years and 11 negative years. The 
full month January barometer has been an even better predictor except in 2009 so 
we have focused on that but the five days is important to look at. To conclude, the 
2008 signal was the strongest at —5.3%, the negative first five days were the worst 
ever, and the negative January led to the very devastating 2008 with a yearly loss 
of —38.5% for the S&P500. The first five days of 2012 was a positive period. 

The results we have found are supplemented with other studies as follows. 

Brown and Luo (2006) consider the performance of the January barometer 
(JanB) in the U.S. from 1941-2003 and find it has predictive ability. More recently, 
Stivers, Sun, and Sun (2009) find, using the simple spread approach, the power of 
the JanB in U.S. indices has declined since it was published in the early 1970s but 
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that it remains a useful market timing technique in the 1975-2006 period. Addi- 
tionally, Sturm (2009) shows the JanB is particularly powerful in the first year of 
the presidential cycle. 

Cooper, McConnell and Ovtchinnikov (2006) focus on the 1940-2006 period and 
consider the robustness of results using NYSE data dating back to 1825. They call 
the effect the “other January effect” but we prefer January barometer. In addition 
to testing the JanB with the full market index, Cooper et al. also find it has predic- 
tive value for both small and large stocks and value and growth stocks. The effect 
persists after adjustment for business cycle and macroeconomic variables, investor 
sentiment, and the presidential cycle. 

Cooper et al. found that over the previous 147 years, the spread between the 
11-month return following positive versus negative January’s was 7.76%, and other 
papers have reported spreads of 10%+. Though consensus exists around this con- 
clusion that January returns have a predictive power, the consensus dissipates at 
the crucial point: Can you profit from it? 


1.3.1 How to trade the January Barometer (JanB) 


The apparent predictive power of the JanB leads us to if and how investors should 
trade to profit by it. The following strategies have been analyzed as ways to use the 
JanB to outperform a passive buy-and-hold strategy (see also Figure 1.8): 


1. Standard JanB strategy: Stay out of the market in January and go long or short 
for the remainder of the year depending on if the January return is positive or 
negative. 

2. JE+JanB strategy: Go long in January based on the original January effect 
(because market returns in January are positive on average) and, for the remain- 
der of the year, follow the standard JanB strategy. 

3. JE + JanB T-bill strategy: Go long in January based on the original January 
effect. If the January return is positive, go long. If the January return is negative, 
invest in t-bills. 


Regarding the standard JanB strategy, Marshall et al. and Cooper et al. came to 
a similar conclusion that a passive buy-and-hold strategy beats the standard JanB 
strategy. This result is unsurprising given that this strategy calls for staying out of 
the market in January to wait for the market signal before investing from February 
through December. By skipping January, the investor misses the excess returns 
experienced in January as documented as the original January effect. 

Therefore the two subsequent strategy alternatives both assume the investor 
goes long in January in order to benefit from the January effect. 

The JE+JanB strategy also underperforms a simple buy-and-hold strategy. 
Cooper et al. report that, while it was much lower than the return after a positive 
January, the average yearly return after negative Januarys was also positive at 
5.71%, so a short strategy in these periods earns less than the ¢-bills or the buy- 
and-hold strategy. Cooper et al. also point out the substantial losses from tail risk in 
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very good years occurring after negative Januarys, when shorting the market would 
have been disastrous for the investor. 

Both Cooper et al. and Marshall et al. find the JE + JanB T-bill strategy to be 
the best of the three above alternatives, but they disagree on whether this strategy 
is superior to the buy-and-hold baseline. 

For example, during the period from 1940 to 2006, Cooper et al. report average 
annual buy-and-hold returns of 11.94% compared to a yearly average of 12.79% for 
the JE+ JanB T-bill strategy. Cooper et al. conclude that following this strategy is 
of value to investors based on the past data. 

Marshall et al. reach the opposite conclusion through the same data, finding 
that the JanB cannot be used by investors to outperform a passive long strategy. 
For the period from 1940 to 2007, they find average yearly buy-and-hold returns 
of 12.68% and JE + JanB T-bill strategy returns of 13.09%. Marshall et al. say 
the discrepancies between the two conclusions are based on dissimilar statistical 
significance and risk assessment calculations as well as differing opinions of the 
economic significance of < 0.05% difference in annual return, which excludes transac- 
tion costs and uses simple spreads that they find biased away from investors trading 
realities. 

The only way the JE+ JanB T-bill strategy differs from the buy-and-hold strat- 
egy is that, following Januarys with negative returns, the investor opts for T-bills 
instead of equity. The former strategy does not outperform the buy-and-hold strat- 
egy because, following Januarys with negative returns, average 11-month T-bill 
returns are only marginally larger than average 11-month equity returns. 


1.3.2 The international January Barometer 


Hensel and Ziemba (1995b), Easton and Pinder (2007), and Stivers, Sun, and Sun 
(2009) address the performance of the JanB in international markets. Hensel and 
Ziemba find similar results in Switzerland and Europe and global as the U.S. 
Namely, about 85% positive years and rest of years following positive Januarys 
and noise about 50-50 following negative Januarys. The mean returns have the 
same basic behavior, being more favorable for positive than for negative Januarys. 
So positive Januarys do seem to have positive predictive power. 

Bohl and Salm (2010) study the predictive power of stock market returns in 
January for the rest of the year for 19 countries. They find that the barometer works 
well in the U.S., as we know, and in Norway and Switzerland. But it did not predict 
well in the other 16 countries which included Japan, France, Spain and Germany. 

The data periods vary by country but are long, for example, Australia 1903-2007, 
Austria 1970-2007, Belgium 1951-2007, Canada 1936-2007, and France 1896-2007. 
In many cases it is the high sigma leading to too low t’s which caused the significant 
non-predictability. But in some cases, the signal is actually going the wrong way 
in their regression model. So care is needed in these various countries to use the 
barometer for added value. 
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1.4 Sell-in-May-and-go-away 


September and October have historically had low stock market returns with many 
crashes occurring in October. November to February have historically had higher 
than average returns; see Gultekin and Gultekin (1983), Keim and Ziemba (2000), 
and this book. This suggests the strategy to avoid the bad months and be in cash 
then and only be long the stock market in the good months. Sell-in-May-and-go- 
away, also called the Halloween effect, is one such strategy. Figures 1.12 and 1.13 
show this strategy using the rule sell on the first trading day in May and buy on 
the 6th trading day before the end of October, for the S&P500 and Russell 2000 
futures indices for the years 1993-2011, respectively. This rule did beat a buy and 
hold strategy. Appendix Tables A.2 and A.3 show the monthly returns, respectively, 
for those 18 years. Tables A.6 and A.7 have the SIM versus buy and hold data from 
Figures 1.12 and 1.13. 

For the S&P500 a buy and hold strategy turns $1 on February 4, 1993, into 
$1.91 on December 31, 2011; whereas, sell in May and move into cash, counting 
interest (Fed funds effective monthly rate for sell in May) and dividends for the buy 
and hold, had a final wealth of $4.03, some 111.6% higher. For the Russell 2000, the 
final wealths were $1.83 and $5.35, respectively, some 192.3% higher. Historically, 
as shown below in the Presidential effects discussion, the SIM rule is not as reliable 
in Presidential election years but it did work in 2008. There are many discussions 
of the SIM rule with differing data years and differing entry and exit rules. Some 
suggest scrapping the rule in 2012, an election year. But with a big stock run up 
based on cheap Fed money and huge problems in Europe and elsewhere, being out 
of the market, in cash, seemed wise. Indeed, when we went to press on Monday May 
28, 2012, the S&P500 was down over 5% in May. 
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Figure 1.12: S&P500 Futures SIM and B&H Cumulative Returns Comparison 1993-2011 (Entry 
at Close on 6*? Day Before End of October; Exit 18* Day of May). 
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Figure 1.13: Russell 2000 Futures Sell in May (SIM) and B&H Cumulative Returns Comparison 
1993-2011 (Entry at Close on 6t! Day Before End of October; Exit 15% Day of May). 


Bouman and Jacobsen (2002) confirm that the SIM effect holds in 36 of the 37 
countries Vacation timing is a potential cause of the effect, suggesting the timing 
of summer vacations may cause temporal variations in appetites for risk aversion. 
However, they find evidence of the effect in their subset of Southern hemisphere 
countries, which under their hypothesis would be expected to have a different sea- 
sonal pattern. 

Seasonal Affective Disorder (SAD) was studied in Kamstra, Kramer, and Levi 
(2003) and Garret, Kamstra and Kramer (2004). SAD is a disorder in which the 
shorter, relatively sunless days of fall and winter cause depression, which recent 
research links to an unwillingness to take risk. Kamstra (2003) concludes that the 
SAD explanation does not lead to a profitable trading strategy because the risk pre- 
mium varies with the seasonal effects. Like the vacation timing hypothesis, Doeswijk 
finds the SAD hypothesis insufficient because SAD is known to start as early as 
September so the historically high November returns cannot be explained. 

Doeswijk (2005) posits that, in the fourth quarter of each year, investors are 
overly optimistic about the upcoming year. This optimism leads to attractive initial 
returns followed by a renewed realism that readjusts expectations. Unlike the SAD 
hypothesis, which suggests a varying risk premium, the Optimism Cycle hypothesis 
reflects a constant risk premium with a varying perception of the economic outlook. 
To test this hypothesis, Doeswijk ran three analyses: 1) the global zero-investment 
seasonal sector-rotation strategy; 2) the seasonality of earnings growth revisions; 
and 3) the initial returns of IPOs as a proxy for investor optimism. 

If this hypothesis is correct, a winning investment strategy is going long in 
cyclical stocks and short in defensive stocks during November through April (winter) 
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and following the opposite strategy from May through October (summer). (These 
stock groups are chosen for their relative exposure to the general economy, with 
cyclical stocks having a high exposure and defensive stocks a low exposure.) To test 
this strategy, Doeswijk uses the MSCI World index of global stock returns from 
1970-2003 and tests the data as a whole, in two 17-year sub-periods, using several 
variations on timing of the winter period, and various sector definitions. The study 
runs regressions using monthly market capitalization-weighted price return indices 
and their monthly log returns. 

Doeswijk finds that, on average during the study period, winter returns are 
a significant 7.6% higher than summer returns and the strategy works in 65% of 
the years. On a monthly basis, average performance of the global zero-investment 
strategy is 0.56%, which is significant at the 1% confidence level. Using further 
regression analysis techniques, Doeswijk also isolates the market timing effects from 
the seasonality and finds that seasonality alone accounts for approximately half of 
the excess returns. 

Both analyses by Doeswijk support the Optimism Cycle hypothesis. Expected 
earnings growth rates follow a seasonal cycle and that these changes have an effect 
on stock performance. The third analysis uses initial IPO returns, which show 
a remarkable seasonality, as a proxy for investor confidence. Using this investor 
confidence proxy as an independent variable, the regression result for remaining 
excess return is not statistically significant, which supports the Optimism Cycle 
hypothesis. 

Along with the three supporting analyses, Doeswijk explains a qualitative argu- 
ment in favor of his Optimism Cycle hypothesis. He argues that, since this phe- 
nomenon is one based on an aspect of human psychology, it tricks investors into 
repeating the same biases every year. Importantly, this cycle of optimism and pes- 
simism is not generally accepted, which Doeswijk argues allows for investors who 
understand it to profit from it as a free lunch until it is more widely accepted and 
the arbitrage opportunity is absorbed into the market. 


1.4.1 Same month next year 


Heston and Sadka (2007) review predictability models for average stock returns 
based on past returns, such as short-term and long-term momentum and reversal 
effects. Several studies confirm various predictability patterns, but several potential 
explanations of these patterns exist, including data snooping, risk compensation, 
or behavioral theories. By expanding the analysis outside of the U.S., Heston et al. 
reduces the potential bias of data snooping and uncovers new information about the 
relative applicability of predictability models in different countries. The paper ana- 
lyzes twelve European countries plus Canada and Japan using a dataset of monthly 
returns from 1985 to 2006. 

Heston and Sadka analyze monthly stock returns using cross-sectional regres- 
sions. They begin their analysis by analyzing various lagging return variables 
to check for various existing temporal prediction patterns, such as short-term 
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momentum effects and medium-term reversal effects. They confirm that the 
momentum effect holds in their international sample, with the exception of medium- 
term momentum effects not being present in Japan. They also uncover a new pat- 
tern, a one-year lag with a reversal effect in the intervening months. Additionally, 
the study expands upon the positive return continuation of one-year lags by check- 
ing on lags of every 12 months for up to 120 months. They find that the positive 
returns are present in these longer term yearly lag periods, though some results 
in longer time frames were not statistically significant due to insufficiently large 
sample sizes in some cases. 

Heston and Sadka also analyze potential portfolio strategies to make use of 
this uncovered pattern. They use the decile spread methodology of Jegadeesh 
and Titman (1993) to investigate various time horizons and sorting methods and 
additionally analyze calendar effects, such as the January effect. They analyze if 
return patterns according to liquidity measures and by country and find that nei- 
ther explains the temporal pattern. They also consider if the temporal pattern 
is a result of common international risk factors, finding that different countries 
are correlated in the short term but over longer time horizons the correlation 
is, while still positive, notably weaker and sometimes statistically insignificant. 
They conclude that international diversification around this strategy is benefi- 
cial and reason that capital market segmentation may leave rewards for seasonal 
risks specific to different countries or that in different countries seasonal news 
may cause relatively predictable behavioral responses that are reflected in return 
movements. 


1.5 Holiday Effects 


There has been a very strong holiday effect in U.S. markets throughout the twentieth 
century. Ariel (1990), Zweig (1986), Lakonishok and Smidt (1988) have documented 
this. For example, for the ninety years from 1897 to 1986, Lakonishok and Smidt 
found that fully 51.5% of the non-dividend returns on the Dow Jones industrials 
were made on the approximately eight yearly preholidays. Ariel using data from 
1963-1982 found a very strong effect with the average preholiday having returns 
that were about 23 times an average day for large capitalized stocks measured by a 
value weighted index of all NYSE stocks. Small capitalized stocks (equally weighted 
NYSE stocks) had returns 14 times larger but since this period was one of extremely 
high small stock returns, the actual returns exceeded the large capitalized securities. 
Lakonishok and Smidt also found that preholidays were associated with higher mean 
returns on all days of the week compared to average returns those days. Investigation 
of the holiday effect in Japan by Ziemba (1991) yielded very similar results. Using 
daily data on the Nikkei stock average from May 1949 when the market opened 
up after World War II to 1988, he found that the typical preholiday had returns 
of about five times the average non preholiday trading day, namely 0.246% versus 
0.0489%. 
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Holiday effects in other countries are discussed by Ziemba (1994) and in the 
survey by Cervera and Keim (2000) and other papers in the Keim and Ziemba 
(2000) volume. 

Historically it was the preholiday that had the highest returns with the —3 day 
having the next highest returns, with the day after the holiday having negative 
returns. 

A regression to separate out the effects on trading days —3 to +2 around a 
holiday using 0,1 variables led to the daily return 


R = 0.0352 +0.0799Day_3 + 0.0222Day_2 +0.1894Day_1 — 0.0663Day +1 + 0.00114Day+2 
(3.745) (1.491) (0.424) (3.709) (—1.334) (0.023) 


Observe the high positive coefficient and t-value on the —1 day. The evidence we 
have to update this to 2010 is from the futures markets from 1993-2010 and it is that 
the effect seems to have moved to the —3 day before the holiday and is much weaker 
than in the past. The pre-holiday is marginally positive for both the S&P500 and 
Russell 2000. Labor day for the Russell 2000 is the most reliable with a mean gain of 
0.88% with a t = 4.81 and 15 of 16 positive. Presidents’ day was also reliable 82.4% 
of the time with a t = 2.11. Since none of the holidays were highly significantly 
positive for the S&P500 and, except for these two, the Russell 2000 results were 
marginal, we conclude that the holiday effect exists to some extent on the —3 day 
but has diminished greatly in the 1990s and 2000s. The mean gain for the S&P500 
was 0.19% (t = 1.74) and the Russell 2000 was 0.26% (t = 2.14). 

Table 1.6 documents the overall results for these two indices on days —3 to +2 
plus other days and all the days. 

Figure 1.14 shows the holiday average returns per day for the S&P500 on the 
days —3 to +2 from 1993-2010. This shows the strong —3 day. Figure 1.15 has this 
by holiday. Figures 1.16 and 1.17 have the results for the Russell 2000. 


Table 1.6: Futures Holiday Average Returns by Day, 1993-2011.* 


Pre H Pre H Pre H After H After H 

S&P500 All Days —3 —2 -1 1 2 Others 
Count 4784 151 151 151 151 151 3910 
St Dev 0.0126 0.0122 0.0099 0.0107 0.0128 0.0143 0.0127 
Average 0.02% 0.19% —0.03% 0.03% 0.06% 0.09% 0.01% 
z 1.1697 1.9549 —0.3156 0.3991 0.5748 0.7871 0.5517 
Positive 53.1% 57.0% 53.0% 51.0% 53.6% 57.6% 52.8% 
Russell 2000 

Count 4791 152 152 152 152 152 3911 
St Dev 0.0151 0.0153 0.0121 0.0133 0.0150 0.0186 0.0151 
Average 0.02% 0.31% 0.00% 0.06% 0.09% 0.09% 0.00% 
z 1.0982 2.4735 —0.0096 0.5802 0.7114 0.6176 0.0536 
Positive 53.2% 63.8% 53.3% 53.9% 50.0% 57.9% 52.7% 


*For some years Xmas 2 AH= New Year —3PH 
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Figure 1.14: S&P500 Futures Average Returns by Day (—3, —2, —1, 1,2) by Day 1993-2011. 
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Figure 1.15: S&P500 Futures Holiday Average Returns by Holiday, 1993-2011. 


1.5.1 The sell on Rosh Hashanah and buy on Yom Kippur anomaly 


This Rosh Hashanah anomaly relates to the Jewish new year. In 2009, that was 
close of September 18 to open on September 28 for the trading days. The origin of 
this practice seems to be the belief of Jewish investors that they should liquidate 
their portfolios during the holiday so that their attentions could be fully focused on 
their worship; or more likely in today’s world, not trade. Could this really be true 
in 2009, 2010 and 2011? 
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Figure 1.16: Russell 2000 Futures Average Returns by Day (—3, —2, —1, 1, 2) by Day, 1993-2011. 
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Figure 1.17: Russell 2000 Futures Holiday Average Returns by Holiday, 1993-2011. 


Going back to 1915, the performance of the DJIA is —0.62% from the last close 
before Rosh Hashanah until eight days later with the last close before Yom Kippur. 
From then to December 31 averaged a respectable 1.99%, see TheStreet.com and 


the Kirk 


Report. 


What happened in 2009? The close on September 18 was 1068.30 and the close 
on September 25 was 1044.38, so —2.24%. So the anomaly worked once again. 
What happened in 2010? 
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Rosh Hashanah for Jewish Year 5771 occurred on sunset Wednesday, 
September 8, 2010—nightfall September 10, 2010 and Yom Kippur began on 
Friday, September 17, 2010. The close on September 8 was 1098.87 and the close 
on September 16 was 1124.66, for a gain of 2.35%. So the anomaly did not work in 
2010. The close on December 31, 2010 was 1257.64 or a gain of 11.82% from the 
close on September 16, 2010. 

Rosh Hashanah 2011 began on September 28 and the S&P500 cash close then 
was 1151.06. Yom Kippur 2011 began on October 7 with the S&P500 close at 
1164.97. The change was +1.2%. So the anomaly failed again. 

Returning to September—October. Figures 1.3ab and 1.4ab show the monthly 
effect in the S&P500 large cap and the Russell 2000 small cap future indices, 
respectively, from 1993-2011 and 2004-2011. Observe that over the longer hori- 
zon October is actually positive and September is slightly negative and there is no 
reliable monthly effect. As the historically strong months of January and February 
are negative, December is reliably positive though. 

More recently, from 2004-2011, October is massively negative except it was up 
10.5% in October 2011 and November, which historically has the strongest TOM is 
also negative. The year by year September—October returns from 1993 to 2010 are 
in Table 1.7. The pattern is clear: these months are like other months except that 
they frequently have big declines —8.38% (2001), 11.31% (2002) and —9.79% (2008) 
for September and the —20.11% (2008) for October for the S&P500. October had 
other great falls in 1929, 1987 and other years. However, over the years 1998-2010 
both September and October were positive on average. The Russell 2000 is similar. 
Additional data is in the Appendix. 


1.5.2 Ramadan 


Bialkowski, Etebari, and Wisniewski (2009) study stock returns during the Muslim 
holy period of Ramadan in 14 predominantly Muslim country during the period 
from 1989-2007. They find that stock returns during Ramadan are almost nine 
times higher (38.1% versus 4.3%) than during the rest of the year and that this 
conclusion persists after controlling for other known calendar anomalies like the 


Table 1.7: S&P500 Futures Average Returns, September and October, 1993-2010. 


1993 1994 1995 1996 1997 1998 1999 2000 2001 
Sep —0.87%  —2.17% 4.48% 6.09% 5.51% 6.86% 1.79% 4.58% 8.38 
Oct 1.87% 2.59% —0.77% 2.59% —3.78% 7.38% 5.73% —1.15% 1.46% 


2002 2003 2004 2005 2006 2007 2008 2009 2010 
Sep  -11.31% —1.43% 0.94% 1.03% 3.02% 4.07% —9.79% 3.16% 8.31% 
Oct 7.96% 5.51% 1.32% —2.08% 2.79% 1.00% —20.11% —2.07% 3.74% 


Average StDev t 
Sep 0.17% 0.058 0.128 
Oct 0.78% 0.061 0.536 
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January effect, Halloween effect, and Turn-of-the-Week effect. Their explanations for 
this phenomenon rely on recent behavioral theories that connect investor emotions 
with their decisions. Specifically, they suggest that these excess returns are a result 
of increased investor optimism experienced during Ramadan as a time of relative 
happiness, solidarity, and social identity for Muslims; they go as far as to suggest 
that Ramadan may cause mild states of euphoria, as suggested by Knerr and Pearl 
(2008). This upbeat or positive sentiment then causes relative overconfidence and 
an increased willingness to take risk, such that investors perceive investments as 
of relatively higher value. Bialkowski et al. also consider that during this relatively 
healthy period there may be a higher demand for equities, as documented by Rosen 
and Wu (2004). They, however, do not find any evidence of a higher trading volume 
during Ramadan. 

In their study, Bialkowski et al. discuss Ramadan to shed light on its potential 
emotional effects, review the clinical effects of fasting, and review empirical evidence 
on the effects of Ramadan on equity prices in 14 Islamic countries. Their empirical 
results compare the average returns during the holy month and the rest of the 
year and find that 11 of 14 countries studied have higher returns during Ramadan; 
the countries that did not exhibit this anomaly are Bahrain, Saudi Arabia, and 
Indonesia. They mention the effects of relatively few observations in the case of 
Bahrain and Saudi Arabia and note the outlier of the Asian Crisis in Indonesia 
effecting its equity prices during Ramadan. 

They further test their results by two event studies, benchmarking returns 
against a constant-mean-return model as well as a predictive market model using 
a proxy of 23 industrialized countries that do not have Muslim majorities. In these 
tests, they calculated cumulative abnormal returns (CAR) as returns in excess of 
what an investor should expect in the absence of Ramadan, finding this CAR to fall 
between 2.5%-3.1% depending on the event study approach. Among other robust- 
ness tests, Bialkowski et al. analyze if this CAR may be compensation for increased 
risk during Ramadan by examining return volatility, but do not find any supporting 
evidence and in fact find that, except Turkey, all countries studied actually showed 
decreased volatility during Ramadan. 

Bialkowski et al. also test their results for illiquidity effects, exchange rate 
considerations, and other accepted calendar effects, and find that the Ramadan 
Effect remains anomalous and can most likely explained by temporal changes in 
investor psychology. They conclude that investors can profit in Muslim stock mar- 
kets by buying shares at the beginning of Ramadan and selling them at its end, 
though they do not explicitly model this strategy nor do they estimate transaction 
costs. 


1.6 Day of the Week Effects 


Historically in U.S. markets, there have been differences in the daily mean returns 
across the days of the week. A common finding is high Friday returns and low 
Monday returns. Early papers showing this are Cross (1973), French (1980), 
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Gibbons and Hess (1981), Gultekin and Gultekin (1983), Lakonishok and Levi 
(1982), and Rogalski (1984). Lakonishok and Smidt (1988) relate the weekend effect 
to the TOM effect, see that section of this chapter. Harris (1986) investigated time 
of day effects. Stocks advanced near the close in all days including on the negative 
Mondays and positive Fridays. The Monday negative returns accrued over the week- 
end for large cap stocks but during the Monday trading session for small cap stocks. 
Stocks tend to advance in the first 45 minutes of trading on all days except Monday 
where they fall. Wang, Li and Erickson (1997) using 1962-1993 data show that 
the Monday declines are from last two Mondays of the month and that there is no 
monthly effect in the first three weeks. Kamara (1997) showed that the weekly effects 
declined in 1962-1993 because of increased institutional trading in large cap stocks. 
But the small cap effect remains. Futures minus spot S&P returns are reversed as 
traders anticipate the effect. Chen and Sinal (2003) argue that short sellers close 
positions on Friday which increases prices and the new Monday shorts lead to Mon- 
day losses. Chan, Leung and Wang (2004) argue that the Monday decline effect is 
largely due to individual not institutional investors. The Monday decline is strongest 
in stocks with low institutional holdings. Moreover, the mean return on Monday is 
the same as on the other four days of the week for stocks with high institutional 
holdings. 

Table 1.8 from French (1980) gives the mean daily returns and other character- 
istics for the US S&P500 index for 1953-1977 and five year sub-periods. One sees 
in this 25 year period negative Mondays and positive Fridays on average. 

International evidence was provided by Dubois and Louvet (1966), Jaffe and 
Westerfield (1985ab), Jaffe, Westerfield and Ma (1989). Steeley (2001) argued that 
the weekend effect in the U.K. disappeared in the 1990s. Moreover, the day-of-the- 
week effects are explained by news arrivals. 

Kato (1990), Kato, Schwartz and Ziemba (1989), Ziemba (1993) also study 
Japanese returns. Kato (1990) found negative Tuesdays not Mondays and posi- 
tive Wednesday and Saturday returns. Ziemba (1993) investigated weeks ending 
on Friday with a full trading session in two parts with a break or on Saturday 
with only the first session. He found that in the weeks with Friday endings, 
Fridays were positive and Mondays negative. But in the weeks with Saturday 
trading, Saturdays were higher positive and the negative day was the Tuesday. 
Somehow the two days were needed for the fall. Choudhry (2000), using January 
2000 to June 2005 data from Indonesia, Malaysia, the Philippines, South Korea, 
Taiwan and Thailand showed the presence of the day of the week effects in 
both stock returns and volatility and a possible spillover from the Japanese stock 
market. 

Table 1.9 from Chukwuogor-Ndu (2006) shows the mean return from Euro- 
pean stock markets from January 1997 to December 2004. Friday is positive in all 
15 countries studied and Mondays are mixed, some positive, some negative. 
Tuesdays and Thursdays are also positive in most countries. Standard deviations 
are high so statistical significance is weak. 
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Table 1.8: Means, Standard Deviations, and T-Statistics of the Percent Return From the Close 
of the Previous Trading Day to the Close of the Day Indicated. 


Monday Tuesday Wednesday Thursday Friday 


1953-1977 Mean —0 1681 00157 0 0967 0 0448 0 0873 


Standard deviation 0 8427 0 7267 0 7483 0 6857 0 6600 
t-statistic —6 823° 0 746 4 534° 2 283> 4 599° 
observations 1170 1 193 1 231 1 221 1 209 
1953-1957 Mean —0 2256 —0 0096 0 1592 0 0553 01413 
Standard deviation 0 8998 0 7498 0 7141 0 6751 06222 
t-statistic —3 851° —0 197 3 497° 1 287 3 533° 
observations 236 238 246 247 242 
1958-1962 Mean —0 1691 0 0537 0 0777 0 0652 0 1131 
Standard deviation 0 8512 0 7223 0 6503 0 6347 0 6097 
t-statistic —3 045° 1 149 1 885» 1 624 2 892° 
observations 235 239 249 250 243 
1963-1967 Mean —0 1389 0 0385 0 1008 0 0517 0 1015 
Standard deviation 0 5820 0 4991 0 5515 0 4933 0 4386 
t-statistic —3 650° 1 193 2 884° 1 660° 3 600° 
observations 234 238 249 251 242 
1968-1972 Mean —0 1673 —0 0058 0 1465 0 0003 0 1034 
Standard deviation 0 7769 0 6233 0 7425 0 6516 0 5898 
t-statistic —3 266° —0 144 3 005° 0 007» 2 705° 
observations 230 239 232 225 238 
1968-1972 Mean —0 1393 —0 0016 0 0057 0 0470 —0 0219 
Standard deviation 1 0379 0 9609 0 9968 0 9102 0 9304 
t-statistic —2 058» 0 026 0 091 0 813 —3 368 
observations 235 239 255 248 244 


aReturns for periods including a holiday are omitted. These returns are defined as Ry = 
ln(P;/P;—1)100. 

b5% significance level 

c0 5% significance level 

Source: French (1980) 


French (1980) using data from 1953 to 1977 on the S&P composite found that 
Friday returns had a positive mean as did Tuesday to Thursday but Monday had 
negative returns. This refuted two hypothesis: 


1. namely that returns occur continuously over time so Monday should have three 
times the average day mean time; and 

2. that all days have the same mean return since returns are generated during 
trading time which are the same for all days. 


Keim and Stambaugh (1984) extend the research back to 1928 that Mondays have 
negative mean returns for exchange traded stocks of all firm sizes and for OTC 
stocks. 

They also found that Friday to Monday correlations were higher than the other 
days. Abraham and Ikenberry (1994) find the same effect. When Friday is negative, 
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Table 1.9: Average Daily Returns of the EFM for the Period January 2, 
1997 — December 31, 2004. 


Country Mon Tue Wed Thur Fri 


Austria 0.0299 0.0233 0.0118  —0.0157 0.0037 
Belgium 0.0036 —0.3000 0.0011 0.3270 0.0134 
Czech Republic 0.0059 0.0154 0.0229 0.0242 0.0152 
Denmark 0.0057 0.0185 0.0213 0.0153 0.0240 
France —0.0120 0.0178 —0.0094 0.0278 0.0139 
Germany 0.0395 0.0253 —0.0196 0.0104 0.0125 
Italy —0.0137 0.0360 —0.0286 0.0280 0.0384 
Netherlands 0.0540 0.0179 -—0.0194 —0.0010 0.0252 
Russia 0.0428 0.1280 —0.0903 0.0721 0.1125 
Slovakia —0.1101 0.0371 0.0480 0.0313 0.0035 
Spain —0.0303 —0.7170 —0.0532 0.7840 0.0220 
Sweden 0.0551 0.0138 —0.0564 0.0177 0.0407 
Turkey —0.1730  —0.0329 0.0122 0.2069 0.2396 
Switzerland 0.0070 0.0132 0.0121 0.0245 0.0357 


United Kingdom —0.0181 0.0200 0.0095 0.0141 0.0313 
Source: Chukwuogor-Ndu (2006) 


Monday is negative 80% of the time with a —0.61% mean return. But when Friday 
was positive Monday is positive on average returning +0.11%. The effect like most 
anomalies is strongest in small and mid-cap companies. 

Lakonishok and Maberly (1990) show that there was an increase in individual 
as opposed to institutional trading who sell more than buy on Mondays. Connolly 
(1989) showed that the strength of the day of the week finding depend crucially on 
the estimation and testing methods used. Sullivan, Timmermann and White (2001) 
provide a strong econometric argument regarding the care needed to get proper 
statistical conclusions. Using 100 years of daily data, they find anomalies like the 
day-of-the-week and weekend effects are significant but if you consider data mining, 
the anomalies are not statistically significant over the whole universe of data. 

Summary: the cash evidence is strong that there was a weekend effect with 
positive Fridays and negative Mondays. But the strength of the effect has diminished 
and it is hard to prove that it is not data snooping and out of sample. Moreover, the 
effect seems to reverse in the futures markets. The reasons for the effect are many 
and varied. The question of can you implement strategies based on these results 
remains. Arsad and Coutts (1997) argue that liquidity and other transactions costs 
might limit this possibility. We now look at the recent data in the futures markets. 

To update, we compute the daily returns in the S&P500 and Russell 2000 futures 
from 1993-2011. Recall that with anticipation the futures might reverse the cash 
effect. Figure 1.18ab and Tables 1.10 give the results. In the futures markets for the 
S&P500, all days was positive with Thursday essentially zero Monday has higher 
average returns than all the days. All the days have gains slightly more than 50% of 
the time. The Russell 2000 is very different with higher returns on all the days except 
for Mondays which were negative. Still all days were positive slightly more than 50% 
of the time. Finally, Table 1.11 and Figures 1.19ab show the results by month. 
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Figure 1.18: Futures Daily Returns as Function of Day of the Week, 1993-2011. 


1.7 Option Expiry Effects in the Russell 2000 and S&P500 
Futures Markets 


Options expiry tends to be a positive period in the third week of the month; see 
Figure 1.20 and Table 1.12 for specific results. As usual, the fluctuations are stronger 
for the small cap Russell 2000 with a quite strong —3 day with a mean return of 
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Table 1.10: Futures Daily Returns as Function of Day of the Week, 


1993-2011. 

Mon Tue Wed Thu Fri All 
S&P500 
Count 917 979 978 962 956 4792 
Mean 0.0139 0.0130 0.0120 0.0124 0.0114 0.0126 
StDev 0.03% 0.06% 0.00% 0.00% 0.01% 0.02% 
t-stat 0.5823 1.4434 0.0776 0.1000 0.3509 1.1771 
Positive 54.7% 51.1% 54.1% 53.5% 52.0% 53.1% 
Russell 2000 
Count 923 978 978 967 957 4803 
Mean 0.0163 0.0153 0.0149 0.0151 0.0135 0.0150 
StDev —0.05% 0.07% 0.04% 0.02% 0.04% 0.02% 
t-stat —0.8561 1.4637 0.7715 0.3519 0.8436 1.1043 


Positive 51.2% 52.4% 55.2% 52.0% 54.6% 53.1% 


Table 1.11: Futures Daily Returns as Function of Day 
of the Week by Month, 1993-2011. 


Mon Tue Wed Thu Fri 


S&P500 

January 0.18 —0.12 0.13 —0.10 —0.14 
February 0.11 0.13 0.05 0.11 0.16 
March 0.01 0.10 0.03 0.17 0.08 
April —0.06 0.30 0.00 0.31 —0.15 
May 0.24 0.05 0.03 —0.14 —0.02 
June —0.09 —0.04 0.04 0.12 —0.04 
July 0.00 0.00 0.09 0.00 —0.07 
August —0.01 —0.14 0.20 —0.16 —0.06 
September —0.14 0.15 —0.11 —0.14 0.27 
October 0.14 0.17 —0.31 0.08 0.14 
November 0.08 0.03 —0.01 0.06 0.12 
December 0.05 0.18 0.13 —0.13 0.18 
Russell 2000 

January 0.08 —0.10 0.12 —0.06 —0.25 
February 0.10 —0.11 0.02 —0.10 —0.08 
March —0.11 0.10 0.07 0.27 —0.01 
April —0.34 0.32 0.13 0.42 —0.12 
May 0.26 0.15 —0.12 —0.07 0.05 
June —0.19 —0.06 0.15 0.16 0.10 
July —0.10 0.00 0.01 —0.15 —0.06 
August —0.08 —0.14 0.33 —0.15 0.01 
September —0.10 0.27 —0.10 —0.10 0.23 
October 0.09 —0.14 —0.30 0.04 0.15 
November 0.08 0.00 0.07 —0.03 0.08 


December —0.01 0.33 0.27 —0.08 0.38 


+0.61% on that day with 67.1% positive and a t-value of 3.62. The other days are 
basically noise. The S&P500 also has on average a gain of 0.40% on the —3 day and 
noise the other days around the options expiry. The ¢ for this day is 2.9 with 60.5% 
positive days. 
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Figure 1.19: Futures Daily Returns as Function of Day of the Week by Month, 1993-2011. 
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Figure 1.20: Futures Average Returns by Day Around Options Expirations, 1993-2011 (N = 1 
Options Expirations Day). 


1.8 Seasonality Calendars 


Ziemba and Schwartz (1991) made seasonality calendars for the first section of the 
Tokyo Stock Exchange (TSE) for the years 1988-1992. The days were ranked from 
—4, the worst, to +4, the best. A regression predicts the expected daily change of 
the index based on all the seasonal effects. So each trading day has both a ranking 
and an expected performance. Of course, this ranking and performance does not 
consider the current economic environment and recent news so it could be off. But 
it gives one an idea of what to expect and what to look for. 
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Table 1.12: Futures Average Returns by Day Around Options Expi- 
rations, 1993-2011. 


3 2 1 1 2 Others 
S&P500 
Count 76 76 76 76 76 4331 
St Dev 0.0121 0.0111 0.0112 0.0095 0.0127 0.0126 
Average 0.40% —0.03% 0.17% 0.01% —0.05% 0.01% 
Z 2.9003 —0.2086 1.3313 0.0551 —0.3184 0.7142 
Positive 60.5% 55.38% 63.2% 50.0% 46.1% 52.7% 
Russell 2000 
Count 76 76 76 76 76 4347 
St Dev 0.0148 0.0118 0.0126 0.0106 0.0166 0.0151 
Average 0.61% —0.05% 0.23% -—0.01% —0.15% 0.02% 
Z 3.6224 —0.3873 1.6119 —0.1234 —0.7912 0.7452 


Positive 67.1% 52.6% 56.6% 50.0% 39.5% 53.1% 


Canestrelli and Ziemba (2000) investigated seasonal anomalies in the Italian 
stock market from 1973-1993. The results show that the effects have been found in 
the U.S., Japan and other markets such as the weekend, turn-of-the-year, monthly, 
holiday and January barometer were present in Italy during that period. The data 
used were 7668 days from January 3, 1973 to December 31, 1993 of which 5238 
were trading days. The highest daily return was +8.03% and the lowest —10.02%. 
The return distribution was fatter in the tails than Gaussian normal and there was 
autocorrelation in these returns. They did a careful analysis of these anomalies and 
readers may refer to their paper in Keim and Ziemba (2000). They also ranked the 
days into seasonality calendars using the following regression: 


Rt= a1D Monday + a2 DTuesday + a3D Friday + a4DistDay T as D Easter 


+ as DChristmas = a7DistNov + agD Jan + ag D Day30 Te a10D pay31 + ut 


Monday Tuesday Friday 1st Day Easter Xmas Ist Nov Jan Day 30 Day 31 
Mean -0.2221 -0.2082 0.0705 1.2667 0.6832 0.3048 0.6672 0.1419 0.3150 0.2386 
(St Dev) (0.0442) (0.0431) (0.0439) (0.0809 (0.2945) (0.29801) (0.2820) (0.0653) (0.1003) (0.1355) 
t-stat -5.03** -4.83** 1.61 15.66** 2.32* 1.09 2.37* 2.17* 3.14** 1.76 


Cost = 0.00R2 = 0.0582 R2aaj = 0.0566D.W. = 1.58F = 32.44«*ReferringtoHo : a1 = a2 = a3 = =aio=0. 


* indicates 5% confidence level ** indicates 1% confidence level 


The results indicate that there was the following ranking of the anomalies by impor- 
tance of their effect on daily stock returns: 


First day of the monthly account 
Monday 

Tuesday 

Day 30 (calendar, not trading) 
15t November 

Easter 

January 


SVs oh te he 
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January 2010 Calendar and Returns 


Sunday Monday Tuesday Wednesday Thursday Friday Saturday 
Legend Code 1 New Year's Day 2 
RL code RL day score|TOM = —Turn-of-the-month 
RL realr RL expected r/RL = Russell 2000 future 
SP code SP day score|SP = S&P500 future 
SP realr SP expected r| 
3 4 5 6 th 8 9) 
Tom 0 0 0 0 o| 
2.15% 0.15%| -0.20% 0.15%| -0.02% -0.12%| 0.50% -0.12%| 0.64% -0.12% 
irom 1 1 o o 0 
1.62% 0.12%] 0.31% 0.12%) 0.07% -0.10%| 0.40% -0.10%| 0.35% -0.10% 
10 11 12 13 14 15 16 
o 0 o o o 
-0.50% -0.12%| -0.67% -0.12%| 0.82% -0.12%| 0.76% -0.12%| -1.42% -0.12% 
o 0 o o 0 
0.09% -0.10%] -0.74% -0,10%| 0.66% -0.10%] 0.33% -0.10%] -1.14% -0.10% 
17 18 ML King Day 19 20 21 22 23 
0 o o 0 
1.54% -0.12%| -1.30% -0.12%] -1.96% -0.12%| -1.47% -0.12% 
0 o o o 
1.19% -0.10%| -1.03% -0.10%] -2.03% -0.10%| -1.80% -0.10% 
24 25 26 27 28 29 30 
irom ifrom ifrom 1from ifrom 1 
-0.08% 0.17%] -0.81% 0.17%] 1.00% 0.17%] -1.85% 0.17%] -0.74% 0.17% 
[TOM 1}Tom 1|Tom 1}Tom 1jtom 1 
0.14% 0.12%] -0.48% 0.12%] 0.67% 0.12%| -1.39% 0.12%] -0.81% 0.12% 
31 
Jan 2010 RL return -3.79% 
Jan 2010 SP return -3.72% 


Figure 1.21: January 2010 Calendar and Results. 


8. Day 31 (calendar, not trading) 
9. Friday 
10. Christmas 


Dzhabarov and Ziemba made similar calendars for the S&P500 and Russell 2000 for 
2010. Figure 1.21 shows the seasonality calendar and results for the S&P500 and 
Russell 2000 futures for January 2010. The results of the exercise with January’s 
S&P500 = —3.72% and Russell 2000 = —3.79% are in Figure 1.21. When the authors 
created the calendar in 2009 they chose to use the coefficients from 2004—2009 rather 
than from 1993-2009 to employ the most current results. This way also offered better 
goodness of fit (higher R-squared). 

Each box for a given trading day shows the model expected returns on the 
right and real returns added after January 2010 on the left for each of two index 
futures plus a ranking from —4 the worst to —3, —2, —1,0, +1, +2, +3 and +4 the 
best (Russell 2000 first, S&P500 second). Obviously, economic news may dominate 
a particular day, but the calendars add value. To compute the calendars, regres- 
sions were run on all the days around an anomaly concept and then a streamlined 
equation with only the best predictors was used to construct the expected values 
and calendar weightings. The authors also used turn-of-the-year variable which con- 
sistently generated positive returns though not being statistically significant. The 
variables that are working: for Russell 2000 3rd day before holiday (positive), TOM 
good days (positive), turn-of-the-year good days (positive), 3° day before options 
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Table 1.13: Number of Days Ranked in Each Category for Each 
Month for the S&P500 and Russell 2000 in 2010. 


Russell 2000 S&P500 
Day Scores 0 1 2 3 4 0 1 2 3 4 
January 14 5 12 7 
February 11 7 1 11 7 1 
March 15 6 1 1 15 6 2 
April 15 6 15 6 
May 14 5 1 14 5 1 
June 14 6 1 1 14 6 2 
July 14 7 14 7 
August 15 7 15 7 
September 13 6 1 1 13 6 2 
October 14 7 14 7 
November 14 6 1 14 6 1 
December 17 3 1 1 12 7 1 2 
Total: 170 71 2 5 4 163 77 3 0 9 


expiry (positive) and for S&P500 3°¢ day before holiday (positive), turn-of-the- 
month good days (positive), turn-of-the-year good days (positive), 3°¢ day before 
options expiry (positive). 

How did the calendars do in 2010? We ran a test starting with initial wealth of 
$100 at the beginning of January. There was a $4 bet if the day’s rating was +4, 
$3 for +3, etc down to —$3 for —3 and —$4 for —4. Table 1.13 shows the number 
of trading days by rank for each month for the S&P500 and Russell 2000. Most of 
the days are ranked 0 or +1 with a few +2, 3, or 4. The lower two panels of this 
table are the number of days for each index that were actually positive and the 
percent positive. Figure 1.22 shows the graphs of the trading results and Table 1.14 
the monthly actual returns. Basically the calendars seem to beat the index when 
the index has low returns but underperform in strong market periods. 


1.9 Political Effects? 


1.9.1 When Congress is in session 


Ferguson and Witte (2006) find a strong correlation between Congressional activity 
and stock market returns such that returns are higher and volatility lower when 
Congress is in session. They use four data sets, including the Dow Jones Industrial 
Average since 1897, the S&P500 index since 1957, and the CRSP value-weighted 
index and CRSP equal-weighted index since 1962. They compare mean daily stock 
returns and annualized returns when the U.S. Congress is in and out of session. 
Depending on the index tested, statistically significant differences in average daily 
returns range from 4—11 basis points per day. Annualized stock returns are 3.3-6.5% 


3This is an updated and modified version of discussion in Ziemba (2012) 
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Figure 1.22: 2010 Calendar and Returns Trading. 


Table 1.14: 2010 Returns From Russell 2000 and S&P500 Buy 
and Hold Versus Calendar Returns. 


Buy & Hold Calendar 

Russell 2000 S&P500 Russell 2000 S&P500 
January —3.79% —3.72% —2.43% —1.68% 
February 4.35% 2.98% 5.03% 4.56% 
March 7.74% 5.57% 4.45% 2.68% 
April 5.50% 1.47% 0.20% 0.11% 
May —8.25% —8.41% 5.56% 3.21% 
June —8.54% —5.94% —2.84% —2.81% 
July 6.48% 6.80% —5.92% —1.72% 
August —7.67% —4.69% 1.45% 1.72% 
September 11.85% 8.31% 10.39% 6.28% 
October 4.00% 3.74% 1.02% 1.30% 
November 3.31% —0.11% —1.42% —3.26% 
December 7.62% 6.18% 2.75% 2.55% 


higher when Congress is out of session, and between 65-90% of capital gains have 
occurred when Congress is not in session (which is notably greater than the pro- 
portionate number of days Congress is not in session). 

Ferguson and Witte also test these results in several ways. First, they analyze 
if the Congressional effect is just a proxy for other known calendar effects, such as 
the Day-of-the-Week effect, January effect, and Pre-Holiday effect. They conclude 
that, after controlling for these anomalies, there is still a Congressional effect of 
3-6 basis points per day, which means that no more than half of the Congressional 
effect is captured by controlling for other known anomalies. The study also tests for 
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robustness and finds there is a low probability that these results are the effect of a 
spurious statistical relationship. 

Next they test if public opinion toward Congress accounts for the Congressional 
effect by using public polling data as a proxy for general investors attitudes toward 
Congress. They use 162 polls from 1939 to 2004, though 112 of these were conducted 
after 1989. They find that an active Congress does not itself lead to poor stock 
returns but rather that the publics opinion of that active Congress accounts for 
the depressed returns. They also find that each index exhibits volatility that is 
significantly lower when Congress is not in session and that this is also driven by 
public opinion. 

Then Ferguson and Witte test the implications of this predictive capability on 
optimal investor asset allocation using the models of Kandel and Stambaugh (1996) 
and Britten-Jones (1999); they find that trading on the Congressional effect would 
allow investors to better allocate between equities and cash and to achieve a higher 
Sharpe ratio. 

Ferguson et al. consider three alternatives as possible explanations of the Con- 
gressional Effect, concluding that their findings may be explained by viewing public 
opinion of Congress as a proxy for investors moods, regulatory uncertainty, or rent- 
seeking. The mood-based hypothesis follows other studies in behavioral finance that 
suggest depressed investors are relatively risk averse, which in this case would imply 
that negative public opinion of Congress was depressing investors and dampening 
returns. The regulatory uncertainty hypothesis follows from the implication that 
there is more uncertainty in the market when Congress is in session, such that risk 
and therefore returns are higher. The rent-seeking hypothesis is based on Rajan 
and Zingales (2003) and suggests that concentrated economic interests limit the 
efficiency of markets such that they are less efficient and biased toward powerful 
financial players when Congress is in session. 


1.9.2 Election cycles 


Herbst and Slinkman (1984), using data from 1926-1977, found a 48-month 
political/economic cycle during which returns were higher than average; this cycle 
peaked in November of presidential election years. Riley and Luksetich (1980) and 
Hobbs and Riley (1984) showed that, from 1900-1980, positive short-term effects 
followed Republican victories and negative returns followed Democratic wins. Huang 
(1985), using data from 1832-1979 and for various subperiods, found higher stock 
returns in the last two years of political terms than in the first two. This finding 
is consistent with the hypothesis that political reelection campaigns create poli- 
cies that stimulate the economy and are positive for stock returns. These studies 
concerned large cap stocks. 

Hensel and Ziemba (1995c, 2000) investigated several questions concerning US 
stock, bond and cash returns from 1928 to 1997. They asked: Do small and large 
capitalization stock returns differ between Democratic and Republican adminis- 
trations? Do corporate bond, intermediate and long-term government bonds and 
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Treasury bill returns differ between the two administrations? Do the returns of var- 
ious assets in the second half of each four-year administration differ from those in 
the first half? Were Clinton’s administrations analogous to past Democratic admin- 
istrations? We also discuss here the terms of George W. Bush and Barack Obama 
to update to the end of 2011. 

Their results indicate a significant small cap effect during Democratic presiden- 
cies. Small cap stocks (the bottom 20% by capitalization) had higher returns dur- 
ing Democratic than Republican administrations. There has also been a small cap 
minus large cap S&P advantage outside the month of January for the Democrats. 
The higher returns with Democrats for small cap stocks are the result of gains 
rather than losses in the April-December period. The TOY small firm effect, in 
which small cap stock returns significantly exceed those for large cap stocks in 
January, under both Republican and Democratic administrations, occurred during 
these 70 years. This advantage was slightly higher for Democrats, but the difference 
is not significant. Large cap stocks had statistically identical returns under both 
administrations. For both Democratic and Republican administrations, small and 
large cap stock returns were significantly higher during the last two years of the 
presidential term than during the first two years. Moreover, bond and cash returns 
were significantly higher during Republican compared with Democratic administra- 
tions. The results also confirm and extend previous findings that equity returns have 
been higher in the second half compared with the first half of presidential terms. 
This finding is documented for small and large cap stocks during both Democratic 
and Republican administrations. Finally, two simple investment strategies based 
on these findings yielded superior portfolio performance compared with common 
alternatives during the sample period. The results cast doubt on the long run wis- 
dom of the common 60/40 stock-bond strategy since all 100% equity strategies 
investigated had much higher wealth at the end of the sample period. Indeed the 
1942-1997 returns were twenty-four times higher with the strategy small caps with 
Democrats and large caps with Republicans than the 60/40 mix and the updated 
1998-2010 returns shown in Table 1.20 show similar outperformance. 

Table 1.15 shows that both small and large cap stocks had lower mean returns in 
the 13 months following an election. Figure 1.24 shows the specific months following 
the election for large (S&P500) and small cap (bottom 20%) stocks. 


Table 1.15: Annual Average Equity Returns for Presidential Election Months 
and the Subsequent 13 Months, 1928-1997, Minus Annualized Monthly 
Averages.* 


1928-1997 1998-2010 1928-2010 
Return Period Large Small Large Small Large Small 
Election + Next 13 Months 8.12 6.51 4.08 12.20 7.54 7.33 
Annual Average 10.12 12.02 5.19 8.22 9.34 11.42 


Annual Difference —2.00 —5.51 —1.11 4.00 —1.79 —4.09 


*Monthly means were annualized by multiplying by 12 
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The 1928-1997 period encompassed 18 presidential elections with an update to 
2010 and three more elections. The end of 1997 included the first year of Clinton’s 
second term. There were 33 years of Republican and 37 years of Democratic admin- 
istrations during this period. The update to the end of 2010 covers the last three 
years of Clinton’s second term plus two George W. Bush terms plus the first two 
years of Barack Obama’s administration, namely 1998 to 2010, a period where small 
cap stocks outperformed large cap stocks. Tables 1.16 and 1.17 list and compare 
the first year, first two year, last two year and whole term mean returns under 
Democratic and Republican administrations from January 1929 to December 1997 
and for January 1937 to December 1997, a period that excludes one term for each 
party during the 1929 crash, subsequent depression and recovery period plus the 
update to 2010. Each term is considered separately, so two-term presidents have 
double entries. The t-values shown in Table 1.16 test the hypothesis that, during 
the 1928-1997 period, returns did not differ between Democratic and Republican 
administrations. 

From 1929 to 1997, the mean returns for small stocks were statistically higher 
during the Democratic presidential terms than during the Republican terms. The 
data confirm the advantage of small cap over large cap stocks under Demo- 
cratic administrations. Small cap stocks returned, on average, 20.15% a year under 
Democrats compared with 1.94% under Republicans for the 1929-1997 period. This 
difference, 18.21%, was highly significant. The first year return differences for this 
period were even higher, averaging 33.51%. 

The right hand panel of Table 1.16 presents the return results after eliminating 
the 1929 crash, the Depression and the subsequent period of stock price volatil- 
ity. Removing these eight years (1929-1936) from the study eliminates one Demo- 
cratic and one Republican administration from the data. The small stock advantage 
under Democrats was still large (an average of 7.55% per four-year-term) but was 
no longer statistically significant. The large cap (S&P500) returns during Demo- 
cratic rule were statistically indistinguishable from the returns under Republican 
administrations. Table 1.17 has the update to 2010. 

For Democratic and Republican administrations, the mean small and large cap 
stock returns were much higher in the last two years compared with the first two 
years of presidential terms for both of the time periods presented in Table 1.16. For 
example, small cap stocks returned 24.65% during the last two years compared with 
15.90% during the first two years for Democrats and 10.18% compared with —6.29% 
for Republicans from 1929 to 1992. Returns on large cap stocks increased to 17.40 
from 8.09% for Democrats and to 9.06% from 3.77% for Republicans for the same 
period. This result is consistent with the hypothesis that incumbents embark on 
favorable economic policies in the last two years of their administrations to increase 
their reelection chances and that the financial markets view these policies favorably. 

The advantage of small stocks over large stocks under Democratic administra- 
tions was not a manifestation of the January small stock effect. Instead, Tables 1.23 
and 1.18 and Figure 1.23 show the relative advantage of small over large cap stocks 
under Democrats compared with that under Republicans was attributable to having 
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Table 1.16: Average Annual Returns for the First and Second Years and Four Years of Demo- 
cratic and Republican Presidencies* to 1997. 


January 1937 to December 1997 January 1929 to December 1997 
S&P500 TR US Small Stk TR S&P500 TR US Small Stk TR 


Democrat 
Avg 1st Yr 6.58 11.32 10.24 19.06 
Avg 1st 2Yrs 6.14 11.85 8.09 15.90 
Avg Last 2Yrs 16.13 24.11 17.40 24.65 
Avg. Term 10.81 16.71 12.62 20.15 
StdDev Term 16.35 27.76 18.26 30.69 
# Years 36.00 36.00 37.00 37.00 
Republican 
Avg 1st Yr 1.87 —6.22 0.54 —14.45 
Avg 1st 2Yrs 6.98 1.39 3.77 —6.29 
Avg Last 2Yrs 15.03 16.95 9.06 10.18 
Avg. Term 11.00 9.17 6.42 1.94 
StdDev Term 15.12 19.89 21.17 27.81 
# Years 28.0 28.0 32.0 32.0 
Diff Ist Yr 4.72 17.54 9.71 33.51 
Diff 1st 2Yrs —0.84 10.46 4.32 22.19 
Diff Last 2Yrs 1.10 7.16 8.33 14.47 
Diff Term —0.19 7.55 6.20 18.21 
lst year t-values 0.67 1.39 1.15 2.58 
(Ho:Diff=0) 
First 2-years t-values —0.14 1.13 0.69 2.39 
(Ho:Diff=0) 
Last 2-years t-values 0.20 0.69 1.20 1.41 
(Ho:Diff=0) 
Term t-values —0.05 1.04 1.29 2.57 
(Ho:Diff=0) 


*In this and subsequent tables, statistically significant differences at the 5% level (2-tail) are 
shown in bold. 
Source: Hensel and Ziemba (2000) 


fewer small stock losses, as well as higher mean small stock returns, in the April- 
December period. Under Democrats, the mean returns were positive in each of these 
months, except October, and the small minus large differential was positive during 
10 of the 12 months; under Republicans, the small minus large differential was 
negative during 9 of the 12 months. 

The small cap advantage also occurred in the months following Democrat 
Clinton’s first election. From November 1992 to December 1993 the small cap index 
rose 36.9% versus 14.9% for the S&P500. This domination continued until the sec- 
ond election. Small caps returned 1.58% per month versus 1.31% per month for the 
S&P500 from November 1992 to October 1996. However, large cap S&P500 returns 
began exceeding small cap returns in 1994 and this continued through 1997. The 
January 1994 to December 1996 returns were small cap 1.36% per month versus 
1.50% per month for the S&P500. From November 1996 to December 1997 small 
caps returned 1.81% per month and the S&P500 2.44% per month. There was a 
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Table 1.17: Average Annual Returns for the First Year and Four Years of Democratic and Repub- 
lican Presidencies* to 1997 Update to 2010. 


January 1937— January 1929- January 1998- 
December 2010 December 2010 December 2010 


S&P500 Small Cap S&P500 Small Cap S&P500 Small Cap 


Democrat 

Avg 1% yr 8.79 13.08 11.86 19.87 26.46 27.19 
Avg 1% 2 yrs 8.74 12.65 10.19 16.07 23.49 17.16 
Avg last 2 yrs 14.10 21.11 15.32 21.83 5.97 9.12 
Avg Term 11.56 16.35 13.08 17.75 16.48 13.94 
Std Dev Term 16.21 26.79 17.95 29.71 15.19 15.46 
# years 38 38 42 42 5 5 
Republican 

Avg 1% yr 0.68 —4.05 —0.19 —11.18 —3.49 3.52 
Avg 1% 2 yrs 4.69 1.35 2.48 —4.92 —3.32 1.23 
Avg last 2 yrs 12.14 14.86 7.78 9.70 2.02 7.56 
Avg Term 8.41 8.11 5.01 2.43 —0.65 4.39 
Std Dev Term 16.96 21.21 21.24 27.28 21.51 24.90 
Avg 15¢ yr 36 36 40 40 8 8 

# years 36 36 40 40 8 8 


phenomenal growth in S&P500 index funds and much foreign investment in large 
cap stocks during this period. While small caps had very large returns, those of the 
S&P500 were even higher. 

We investigated how inflation varied with political regimes. The results for 
the 1929-1997 period, using the Ibbotson inflation index, indicate that inflation 
was significantly higher under Democrats, but this difference was contained in the 
1929-1936 period. Excluding this early period, inflation was slightly higher, on aver- 
age, under Democrats but not statistically different from inflation under Republican. 
Inflation rates differed across the years of the presidential terms. For example, for 
the 1937-1997 period, in the first year of the presidential term, inflation under the 
Democrats was significantly lower than it was under the Republicans. An analysis of 
the first and second two years of administrations during this same period indicated 
that inflation was higher under Democrats but the difference was not statistically 
significant. 


US bond returns after presidential elections 


The bond data are also from Ibbotson Associates and consist of monthly, continu- 
ously compounded total returns for long term corporate bonds, long term (20-year) 
government bonds, intermediate (5-year) government bonds, and cash (90-day 
T-bills). 

Figure 1.24(a) illustrates average return differences for bonds during election 
months and the subsequent 13 months (1929-1997) minus each months 1928-1997 
average return. Figure 1.24(b) updates this to 2010. Corporate, long term govern- 
ment, and intermediate government bond returns were all higher than the monthly 
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Table 1.18: Average Monthly Small- and Large-Cap Stock Returns During Democratic and 
Republican Presidencies, January 1929 — December 2011. 


January 1929 to December 2011 


Democratic Republican Dem diff. 
S&P500 small-cap small-S&P S&P500 small-cap small-S&P Rep diff 
January 1.33 5.35 4.02 1.25 4.88 3.63 0.38 
February —0.37 0.92 1.29 0.92 1.86 0.94 0.36 
March 0.16 —0.43 —0.59 0.69 1.19 0.50 —1.08 
April 2.30 2.81 0.51 0.24 —1.12 —1.36 1.87 
May 0.75 0.91 0.16 —0.11 —0.72 —0.61 0.77 
June 1.49 1.60 0.11 0.16 —0.32 —0.49 0.60 
July 1.85 2.45 0.60 1.10 0.36 —0.74 1.34 
August 0.73 0.88 0.15 1.41 1.19 —0.22 0.38 
September 0.43 0.92 0.49 —2.81 —3.22 —0.41 0.90 
October 0.90 0.04 —0.87 —0.21 —2.02 —1.81 0.94 
November 1.39 1.56 0.17 0.67 0.01 —0.66 0.84 
December 1.81 2.24 0.43 1.45 0.21 —1.24 1.67 
January 1929 to December 1997 (Hensel and Ziemba, 2000) 
ga Poe O ea o EO O a eni di 
S&P500 small-cap small-S&P S&P500 small-cap smal-S&P Rep diff 
January 1.72 6.45 4.73 1.65 5.93 4.28 0.45 
February —0.38 0.74 1.12 1.59 2.78 1.19 —0.07 
March —0.58 —0.91 —0.33 0.96 1.21 0.25 —0.58 
April 2.25 2.58 0.33 —0.24 —1.82 —1.58 1.91 
May 1.07 1.40 0.33 —0.50 —1.52 —1.02 1.35 
June 1.57 1.71 0.14 0.78 —0.40 —1.18 1.32 
July 1.95 2.81 0.86 1.69 1.11 —0.58 1.44 
August 1.17 1.65 0.48 1.73 1.25 —0.48 0.96 
September 0.40 0.78 0.38 —2.87 —3.31 —0.44 0.82 
October 0.42 —0.24 —0.66 —0.40 —2.66 —2.26 1.60 
November 1.44 1.61 0.17 0.44 —0.53 —0.97 1.14 


December 1.56 1.58 0.02 1.59 —0.09 —1.68 1.70 


Jan 1998 to Dec 2011 


sae ae SNE E eta 

S&P500 small-cap small-S&P S&P500 small-cap small-S&P Rep diff 
January —1.57 —2.82 —1.25 —0.37 0.67 1.05 —2.30 
February —0.32 2.27 2.59 —1.75 —1.81 —0.07 2.65 
March 5.62 3.13 —2.50 —0.37 1.11 1.48 —3.97 
April 2.66 4.54 1.88 2.14 1.67 —0.47 2.35 
May —1.61 —2.70 —1.09 1.46 2.50 1.04 —2.13 
June 0.90 0.81 —0.09 —2.31 —0.02 2.29 —2.37 
July 1.13 —0.20 —1.33 —1.24 —2.63 —1.39 0.06 
August —2.51 —4.79 —2.27 0.15 0.95 0.80 —3.07 
September 0.67 1.99 1.32 —2.59 —2.86 —0.28 1.59 
October 4.48 2.08 —2.40 0.54 0.52 —0.02 —2.38 
November 0.99 1.20 0.21 1.60 2.17 0.57 —0.36 


December 3.63 7.13 3.50 0.88 1.40 0.52 2.98 
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Figure 1.23: Cap Size Effects and Presidential Party, Democratic (Small Minus Large) Minus 
Republican (Small Minus Large). 


average in the year following an election only in May, October and November in the 
1928-1997 period. Both government bonds also exceeded the average in some other 
months. The update only has three elections and the monthly pattern is different 
than it was in the past. 

As Table 1.19 indicates, the performance of fixed income investments differed 
significantly between Democratic and Republican administrations. All fixed income 
and cash returns were significantly higher during Republican than during Demo- 
cratic administrations during the two study periods. The high significance of the 
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Figure 1.24: Stock Monthly Return Differences: Presidential Election Months and the Subsequent 
13 Months Minus Monthly Averages. 
Source: Hensel and Ziember (2000b) 


cash difference stems from the low standard deviation over terms. The performance 
of fixed income investments differed very little between the first two years and the 
last two years of presidential terms. 

The distribution of Democratic and Republican administrations during the 
1929-1997 period played a part in the significance of the fixed income and cash 
returns. As Table 1.19 indicates, the cash returns for the first four Democratic 
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Table 1.19: Annualized Average Monthly Return.* 


Term President Party Bonds Cash Bonds-Cash 
1929-1932 Hoover Republican 4.61 2.26 2.35 
1932-1936 Roosevelt Democratic 5.05 0.20 4.85 
1937-1940 Roosevelt Democratic 3.73 0.08 3.65 
1941-1944 Roosevelt Democratic 1.74 0.25 1.49 
1945-1948 Roosevelt /Truman Democratic 1.48 0.50 0.98 
1949-1952 Truman Democratic 1.24 1.35 —0.11 
1953-1956 Eisenhower Republican 1.19 1.66 —0.47 
1957-1960 Eisenhower Republican 4.24 2.54 1.70 
1961-1964 Kennedy/Johnson Democratic 3.21 2.84 0.37 
1965-1968 Johnson Democratic 2.76 4.43 —1.67 
1969-1972 Nixon Republican 7.06 5.19 1.87 
1973-1976 Nixon/Ford Republican 7.42 6.25 1.17 
1977-1980 Carter Democratic 3.17 8.11 —4.94 
1981-1984 Reagan Republican 13.71 10.39 3.32 
1985-1988 Reagan Republican 10.35 6.22 4.13 
1989-1992 Bush Republican 10.77 6.11 4.66 
1993-1996 Clinton Democratic 5.74 4.30 1.44 
1997—2000 Clinton Democratic 5.77 5.07 0.69 
2001-2004 Bush Republican 3.69 1.84 1.85 
2005-2008 Bush Republican 4.00 3.40 0.61 
2009-2011 Obama Democratic 2.82 0.17 2.65 


Source: updated from Hensel and Ziemba (2000) 

*From 1998-2011 we used the 3-month T-bill secondary market rate discount basis for cash and 
market yield and U.S. Treasury securities at 5-year constant maturity, quoted on investment basis 
for bonds. 


administrations in this period (1933-1948) were very low (0.20%, 0.08%, 0.25% and 
0.50% annually). This result largely explains why the term cash-return differences 
are so significant (t-value = —12.31 for 1929-1997). Democratic administrations were 
in power for three of the four terms during the 1941-1956 period, when government 
bonds had low returns. Bond returns in the 1961-1968 period (both Democratic 
terms) and 1977-1980 period (Democratic) were also low. 


Some simple presidential investment strategies 


Two presidential party based investment strategies suggest themselves. The first is 
equity only and invests in small caps with Democrats and large caps with Repub- 
licans; the second, a simple alternating stock-bond investment strategy, invests 
in small cap stocks during Democratic administrations and intermediate govern- 
ment bonds during Republican administrations. The test period was January 1937 
through December 1997 with an update from 1998 to 2010. 

The common 60/40 (large cap/bonds) portfolio investment strategy and provides 
a benchmark for comparison with the two strategies. Transaction costs were not 
included, but they would have a minor effect on the results because the higher return 
presidential strategies trade at most every four years. These investment strategies 
all lost money until the early 1940s, see Table 1.20 which shows the cumulative 
wealth. 
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Table 1.20: Value of $1 Initial Investment in 1997 and 2010. 


Large Cap Small Cap Presidential Presidential 60/40 
Date (S&P) (SC/Int) (SC/LC) Benchmark 
Jan 1937—Dec 1997 346.1 453.2 527.9 963.2 140.5 
Jan 1942—Dec 1997 639.0 2044.1 2380.9 4343.8 180.9 
Jan 1937—Dec 2011 574.7 919.5 1255.9 1349.2 247.0 


Jan 1942—Dec 2011 1061.1 4147.1 5664.2 6084.6 318.0 


Source: Hensel and Ziemba (2000), updated 
*Russell 2000 Index TR after December 1997 


The two presidential investment strategies performed well over the sample 
period. The strategy of investing in small cap stocks during Democratic admin- 
istrations and large cap stocks during Republican administrations produced greater 
cumulative wealth than other investment strategies. The alternating stock-bond 
strategy of investing in small cap stocks under Democrats and intermediate bonds 
under Republicans produced the second highest cumulative wealth. Both of these 
presidential party based strategies had higher standard deviations than large cap 
stocks alone during the 1937-1997 period. Clinton’s first administration had returns 
for small and large cap stocks, bonds, and cash consistent with the past. However, 
in the first fourteen months of his second administration large cap stocks produced 
higher returns than small cap stocks. 

In the update in Table 1.20 we see that, for the 1942-2011 period, small cap 
stocks (Russell 2000 from 1998) produced about four times the gains of large cap 
S&P500 stocks (4147.1 versus 1361.1). But the small cap with Democrats and large 
cap with Republicans was even higher at 6084.6. Meanwhile, the 60/40 portfolio 
was at 318.0 about an as much!. 

Table 1.21 displays the mean returns and standard deviations for the various 
subperiods for the various strategies. 


Remarks 


An interesting finding of this study was the much higher small-stock returns dur- 
ing Democratic administrations as compared with Republican administrations. This 
finding is consistent with the hypothesis that Democrats devise economic policies 
that favor small companies and consequently, their stock prices. The 33.51% differ- 
ence between small stock performance in Democratic and Republican administra- 
tions in the first year in office and the 18.21% difference for the full four-year term 
are very large. In 2011 up to the end of July, the Russell 2000 small cap index has 
moved above its 2007 high but the S&P500 has not. So the small cap advantage 
with a Democratic president is continuing. 

This political party effect is different from the well-known January small 
firm effect which has been present for Republicans as well as Democrats. We 
found in addition a substantial small stock/large stock differential outside of 
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Table 1.21: Average Returns and Standard Deviations for Different Investment Strategies for 
Different Investment Horizons. 


Strategies 
4 Large-Cap Small-Cap Pres (SC/Int) Pres (SC/LC) 60-40 
Dates years Mean StDev Mean StDev Mean StDev Mean StDev Mean StDev 
Jan 1937—Dec 1997 61 10.9 15.8 13.3 24.5 12.6 20.7 14.1 22.8 86 10 
Jan 1938-Dec 1997 60 11.8 15.5 14.9 23.7 14.3 19.8 15.8 22 9.2 9.8 
Jan 1948-Dec 1997 50 12.3 14 13.9 19.2 13.1 12.7 14.9 16.5 97 9 
Jan 1958—Dec 1997 40 11.6 14.2 14.7 20.2 14.6 13 15.6 17 9.7 9.3 


Jan 1968—Dec 1997 30 11.4 15.1 12.7 21.5 143 12.4 143 17.5 10.1 9.9 
Jan 1978—Dec 1997 20 15.4 14.7 163 195 159 13.8 17.2 17.9 12.9 9.9 
Jan 1988—Dec 1997 10 16.6 11.9 15.2 149 13.7 99 16.2 13.2 13.2 8.2 
Jan 1993—Dec 1997 5 18.4 106 17.7 13.3 17.7 13.3 17.7 13.3 13.5 7.4 
Jan 1995—Dec 1997 3 27.1 11.2 22.1 15.1 22.1 15.1 22.1 15.1 19.7 7.6 


Jan 1993—Dec 1993 1 9.5 6.1 19 9.4 19 9.4 19 9.4 10 4.3 
Jan 1993—Dec 1994 2 5.4 8.5 11.1 9.8 11.1 9.8 11.1 9.8 4.3 6.5 
Jan 1995—Dec 1996 2 26.3 8.4 22.9 14.2 22.9 14.2 22.9 14.2 19.3 5.9 
Jan 1997—Dec 1997 1 28.8 15.8 20.5 17.6 205 17.6 205 17.6 20.5 10.5 


Jan 1937—Dec 2010 74 8.9 16 12 23.5 11.9 20.1 12.5 22 7.9 11.7 
Jan 1938—Dec 2010 73 9.6 15.7 13.3 22.8 13.2 19.3 13.9 21.2 8.4 11.6 
Jan 1948—Dec 2010 63 9.7 146 12.3 18.7 12.1 13.7 12.8 16.5 8.7 11.5 
Jan 1958—Dec 2010 43 10.7 14.9 15.5 19.5 16.1 141 16 16.9 10.4 12 

Jan 1968—Dec 2010 33 10.2 15.6 14 20.3 16.3 14 149 17.3 11 12.7 
Jan 1978—Dec 2010 23 13.2 15.5 17.7 18.5 185 15.1 17.7 17.4 13.9 13.4 
Jan 1988-—Dec 2010 13 12.4 15 17.8 16 18.9 145 17.3 154 149 14.3 
Jan 2003—Dec 2010 8 7.7 15.2 12.4 20.5 9.7 13.3 9.3 17.4 5.9 9.1 
Jan 2008—Dec 2010 3 —0.3 16.7 6.2 21.9 8.3 15.7 4.9 19.7 4.7 10 


Source: Hensel and Ziemba (2000) and update to 2010 


January during Democratic rule (see Table 3). Large stock returns were sta- 
tistically indistinguishable between Democrats and Republicans, but bond and 
cash returns were significantly higher during Republican than during Demo- 
cratic administrations. We also confirmed and updated Huang’s finding that large 
cap stocks have had higher returns in the last two years of presidential terms; 
this finding applies regardless of political party and for both small and large 
cap stocks. 

A study of the differences in economic policies that lead to the divergence of 
investment results according to which political party is in office would be interesting. 
Clearly, candidates seeking reelection are likely to favor economic policies that are 
particularly attractive to the public; and those policies are consistent with higher 
stock prices. Cash returns did not differ significantly between the first and second 
two-year periods of Democratic and Republican presidential terms. 


1.9.3 Election cycles: Other literature 


Stoval (1992) and Hensel and Ziemba (1995, 2000) documented the Presidential 
Election Cycle effect, which exhibited that stock markets generally had low returns 
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during the first two years after an US Presidential election and high returns during 
the last two years. Other subsequent studies have documented the economically and 
statistically significant difference in equity returns during the first and second half 
of Presidential terms for Republican and Democratic administrations. Some studies 
use more detailed models. Wong and McAleer (2008) examine the cyclical effect that 
Presidential elections have on equity markets using a spectral analysis technique 
and an exponential GARCH Intervention model to correct for time-dependence 
and heteroskedasticity. They consider the period from January 1965 to December 
2003 using weekly data with dummy variables to designate the year of the term and 
the President’s party. 

Wong and McAleer find a cyclical trend that mirrors the four-year election 
cycle with a modified cycle of between 40-53 months. They find that stock prices 
generally fall until a low-point during the second-year of a Presidency and then 
rise during the remainder, peaking in the third or fourth year. During the current 
Obama Democratic administration, the low was in March 2009 in his first year and 
the market has doubled since then to the end of March 2011. Wong and McAleer 
also find this Presidential Election Cycle effect to be notably more significant under 
Republican administrations, leading them to posit that the Republican Party may 
engage in policy manipulation in order to benefit during elections relatively more 
than their Democratic counterparts. For instance, the second-year and third-year 
effect estimates are not significant for Democratic administrations. 

Wong and McAleer explain the Presidential Election Cycle as follows. During 
the first year of a Presidency, voters are on average optimistic, and Presidents are 
likely to put their most divergent and expensive new policies in place, because they 
have the mandate of the voters and re-election time is furthest away. These early 
measures are relatively disadvantageous to business profits and stock prices because 
they usually involve higher taxes and spending and possibly new regulations. Then, 
during the second year of a term, Presidents begin to alter their policies to ones 
that are less drastic and more voter-friendly. 

The Presidential Election Cycle effect persists when looked at by President and 
by party. For instance, the only two Presidents who did not exhibit the cycle effect 
were Ronald Reagan and Bill Clinton during their second terms, during which they 
would not have re-election incentives like first-term Presidents. Empirical results 
also find that Republicans who were subsequently re-elected had a positive effect 
during the second-year of their term instead of the negative effect expected by the 
Presidential Election Cycle hypothesis. This suggests these Republicans may have 
used government policies to their favor to win re-election and should be useful for 
incumbent Presidents to consider in their electoral strategy. This last conclusion, 
however, does not follow from the conflicting observation that bull markets have 
tended to coincide with sub-periods under Democratic administrations. Wong and 
McAleer conclude that this anomaly was present during most of the last forty years 
and is likely still present in the market. 
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1.10 Turn-of-the-month Effects 


Historically there have been high returns for both large and small cap stocks around 
the turn-of-the-month (TOM). Market advisors such as Merrill (1966), Fosback 
(1976) and Hisrch (1986) have argued that stocks advance at the TOM. Ariel (1987) 
documented this for the U.S. using equally and value weighted indices of all NYSE 
stocks from 1963-1981; see Figure 1.25a, b. 
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Figure 1.25: The U.S. Turn of the Month Effect, Mean Daily Percent Returns on Trading Days 
—9 to +9, 1963-1981. 


Source: Ariel (1987) 
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The five days —1 to +4 historically were the TOM and had a large amount of 
the monthly gains. Indeed that period and the second week actually had all the 
monthly gains. Ariel (1987) found the following portfolio gains 


Equally weighted Value weighted 


First half of trading month 2552.40% 565.40% 
Last half of trading month —0.25% —33.80% 


Nineteen years 2545.90% 339.90% 


Lakonishok and Smidt (1988) found that during the ninety year period, 1897-1986, 
the large capitalized Dow Jones industrials rose 0.475% during the four day period 
—1 to +3 each month whereas the average gain for a four day period was 0.061% 
with an increase in prices over 56% of the time. The average gain per month over 
these 90 years was 0.349%. Hence, aside from these four days at the turn of the 
month, the DJIA actually fell. 

The effect has continued in recent years even in the presence of index futures 
contracts which began trading in the U.S. in 1982. Hensel, Sick and Ziemba (1994) 
found for the period May 1982 to April 1992 using the S&P500 large cap and Value 
Line small cap indices, consistent with the previous evidence that about two thirds 
of the months gains occur on trading days —1 to +4, the turn of the month, and the 
rest of the months gains occur on trading days +5 to +9 so that all or more than 
all of the gains occurs in the first half of the month. The second half was at best 
noise. The effect was monthly dependent with the largest gains in January and size 
dependent with the small capitalized value line index of about 1650 stocks having 
higher means and lower standard deviations than the large capitalized S&P500 
index. There was partial anticipation in the futures market as shown in Figure 1.26. 
For the small capitalized value line index, the cash effect on day —1 was partially 
anticipated on days —4 to —2. Then the effect in the cash market on days +2 and 
+3 was partially anticipated on day +1. Hence, the cash market effect on days 
—1 to +4 was as Ariel found for the 1963 to 1981 data with small gains on days 
—4 to —2. For the large capitalized S&P500 index, the results were similar except 
that there are higher returns in the cash market in the anticipation period (—4 to 
—2) and lower returns in the —1 to +4 period. 

The reasons for the turn of the month effect are several but they are largely 
cash flow and institutionally based. See Hensel, Sick and Ziemba (1996) and Gon- 
zalez (2006) who discusses that paper. The U.S. economy uses a system where 
much money is paid on the —1 day such as salaries and bills and debt payments. 
In addition to some of this money being invested in the stock market, there are 
institutional corporate and pension fund purchases at that time. These cash flows 
vary by month and lead to higher average returns in January which has the highest 
cash inflow. Ogden (1990) presents some empirical support of this hypothesis and 
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Figure 1.26: Mean Percentage Daily Returns in the Cash and Futures Market for Small and Large 
Capitalized Stocks by Trading Day of the Month, May 1982 — April 1992. 


Source: Hensel, Sick and Ziemba (1994) 


related monetary actions for U.S. markets. Another factor in this effect seems to 
be behavioral. One manifestation is that bad news such as that relating to earnings 
announcements is delayed and announced late in the month while good news is 
released at the beginning of the month; see Penman (1987). Bouges, Jain and Puri 
(2009) show a turn of the month effect in the S&P ADR. 
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The TOM was similar in Japan except that the dates change with the turn being 
days —5 to +2, with +3 to +7 being the rest of the first half of the month. Ziemba 
(1989, 1991) investigated this. The reasons for the effect in Japan seem to be: 


e Most salaries were paid during the period of the 20-25 day of the month, with 
the 25t! being especially popular. 
There was portfolio window dressing on day —1. 


Security firms could invest for their own accounts on amounts based on their 
capitalization. Since their capitalization usually rises each month and is computed 
at the end of the month, there is buying on day —3 to account for this. Buying 
was done as soon as possible. 

Large brokerage firms had a sales push that on day —3 and lasted 7 to 10 days. 
Employment stock holding plans and mutual funds received money in this period 


to invest, starting around day —3. 

Individual investors bought mutual funds with their pay, which they received on 
calendar days 15 to 25 of the month; the funds then invested in stocks with a lag, 
so most of the buying occurred on days —5 to +2. 

For low liquidity stocks, buying occurred over several days by dealing in accounts 


to minimize price pressure effects. 


Using data on the NSA from 1949 to 1988 Ziemba (1991) found that all of the days 
—5 to +2 had significantly positive returns. As in the U.S. all the gains occurred in 
the first half of the month and the second half had zero or negative returns. 

Ziemba (1989) investigated the futures market trading outside Japan on the 
Simex in Singapore on the turn of the month and other anomalous effects in 
Japanese security markets during the period September 1986 to September 1988 
before there was futures trading on the NSA or Topix in Japan. He found that the 
spot effect was consistent with past data so the futures market did not alter the 
effect. However, the futures market in Singapore totally anticipated the effect on 
days —8 to —5 with a total average rise on 2.8%. Then when the effect occurred on 
days —5 to +2 and the spot market gained 1.7%, the futures market was flat. 

In our update for the U.S. markets using S&P500 and Russell 2000 futures data 
from 1993-2010 and 2004-2010, we found that the TOM effect still exists with a 
bit of anticipation. Figures 1.27ab and 1.28ab and Tables A.8abcd document the 
results. For example, for the S&P500, for the longer sample and also for the shorter 
more recent data, the days —5 to +2 all have positive returns except —1 and —2 
(which have small mean losses). For the Russell 2000 the same days all have positive 
mean returns for the longer sample and for the more recent data with —2 having a 
slightly negative mean. 


1.11 Open/Close Daily Trade on the Open 


Branch and Ma (2006) analyzes an anomaly that may contradict the weak form of 
the efficient market hypothesis. This hypothesis holds that a time series of returns 
should not contain meaningful autocorrelation, or, in other words, that a stocks past 
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Figure 1.27: S&P500 Futures Average Daily Returns During TOM by Day. 


returns should not have predictive power about future returns. In contradiction, 
Branch et al. find a very strong negative autocorrelation between the overnight 
return (between the close of the market and its opening the next day) and the 
intraday return (the portion that occurs during the day while the market is open). 
The study analyzes stocks on the NYSE, AMEX, and NASDAQ over two periods 
between 1994 and 2005, as well as broken down into size categories, and finds 
statistically significant results across each sub-sample. Branch et al. goes further to 
hypothesize that the cause of this anomaly is related to the behavior of specialists 
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Figure 1.28: Russell 2000 Futures Average Daily Returns During TOM by Day. 


or market makers and their strategies and incentives on how to open their assigned 
stocks relative to the previous days closing price. 

Branch et al. also consider whether there is a strategy that exploits this anomaly 
but offer only initial conclusions. According to their explanation of the basis of the 
anomaly, they reason that only market makers are currently able to exploit this 
anomaly because they have the benefit of knowing the balance of overnight orders. 
For the trading public, their main actionable conclusion for this analysis is advice 
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against a public trader placing an order to be executed at opening, which they 
equate to putting in an order at a disadvantageous price. 

Cooper, Cliff and Gulen (2008) exhibits that the US equity premium return over 
the decade from 1993 to 2006 has solely been a result of overnight returns, with 
intraday returns being close to zero. While past studies had shown that daily market 
closures had a relatively clear effect on trading volume and stock price volatility, the 
implications as far as return timing did not carry a consensus. Cooper et al. offer 
strong evidence supporting the hypothesis that the majority of returns are made 
when markets are closed. The find the difference between night and day returns 
is between 2.61 and 7.61 basis points per day and that these results are robust 
across asset types, sub-periods, and markets (including NYSE, AMEX, NASDAQ, 
and Chicago Mercantile Exchange). The study finds that typical explanations such 
as risk, earnings surprises, and illiquidity do not substantially explain this pattern 
and instead imply there is an inefficiency in market opening and closing mecha- 
nisms. Particularly they suggest this may carry tradable implications for portfolio 
managers, particularly those with low marginal trading costs. 


1.12 Industry Concentration 


Hou and Robinson (2005) analyze the asset pricing implications of industry market 
structure by running regressions of average monthly returns on industry concentra- 
tion from 1963 to 2001. They conclude that firms in the least concentrated industries 
(18* quintile) earn 4% per year higher returns than firms in the most concentrated 
(5*" quintile). The greater risk is called the concentration premium and is described 
by innovation and distress risk. The metric used to quantify industry concentration 
is the Herfindahl index, a relative measure of industry market share dilution, as 
defined by the three-digit SIC classification. They also find that the concentration 
premium is greater in industries with a higher book-to-value ratio. 

Investors could possibly set up a long-short strategy that is long in firms or 
industries with low concentration and short in firms or industries with high concen- 
tration. The 4% advantage is not that great so given transaction costs, this edge 
might be combined with other strategy edges. 


1.12.1 Weather: Sun, rain, snow, moon and the stars and clouds 


Hirschleifer and Shumway (2003) point out that psychologists have documented 
correlation between sunshine and behavior for decades. Sunshine has been linked to 
tipping (Rind, 1996) and lack of sunshine to depression (Eagles, 1994) and suicide 
(Tietjen and Kripke, 1994). People feel good when the sun shines and when they 
feel good they are more optimistic and may be more inclined to buy stocks thus 
leading to higher stock prices. 

Roll (1992) argues that 


“weather is a genuinely exogenous economic factor... it was a favorite example of an 
exogenous identifying variable in the early econometrics literature ... because weather 


64 Calendar Anomalies and Arbitrage 


is both exogenous and unambiguously observable... weather data should be useful 
in assessing the information processing ability of financial markets.” 


See also Trombley (1997), Kramer and Runde (1997) and Loughran and Schulz 
(2004). 

Hirshleifer and Shumway (2003), following the earlier paper by Saunders (1993), 
examine the relation between morning sunshine at a country’s leading stock 
exchange and market index stock returns that day at 26 stock exchanges interna- 
tionally from 1982-1997. They find that sunshine is strongly positively correlated 
with daily stock returns. After controlling for sunshine, other weather conditions 
such as rain and snow are unrelated to returns. Keef and Roush (2007), using data 
on 26 international stock exchanges show the not surprising result that the sun- 
shine effect is monotone stronger the further one is from the equator and the per 
capita GDP. There is no effect at the equator and a big effect on northern stock 
exchanges. An interesting question for future research, that we are working on in 
a forthcoming paper, is does the sunshine effect have anything to do with “sell in 
May and go away” that suggests being out of the stock market in the sunniest time 
of the year? 

Yuan, Zheng and Zhu (2006) investigate the relationship between lunar phases 
and stock market returns in 48 countries. Stock returns are lower on the days around 
a full moon compared to the days around a new moon. The return difference is 3-5% 
per year based on equal and value weighted global portfolios. The result is not due 
to changes in stock market volatility or trading volumes. Also the lunar effect is not 
explained by announcements of macroeconomic factors or major global shocks, and 
is independent of calendar anomalies such as January, day of the week, calendar 
month or holiday effects. 


1.13 Conclusions and Final Remarks 


In the main, the anomalies are still there with some moving around. In the past, 
some of the anomalies such as the TOM and January effect had very high prediction 
accuracy. Currently the January barometer and sell in May and go away which deal 
with longer range predictions have similar reliability. Other anomalies such as the 
January and holiday effects still exist and add value. The monthly effect has become 
noise and has no predictive value. 

How did the anomalies do in practice this year? 

The TOY trades are in addition to the predominately S&P500 and Russell 
2000 option biases strategies but have added value and were just used in December 
2011. Dzhabarov and Ziemba have run a private account with various anomalies — 
seasonal and option biases, including the TOY trades in December 2009, 2010 and 
2011. The TOY effects are shown in the graph in Figure 1.29. The TOY trades can 
be seen at the far left and far right and in the middle of the wealth graph. 

What did Ziemba learn from Hirsch and Hirsch (2011)? 
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e=e\VTZMI Trading NAV Net of Commissions and 2% + 20% Fees 


—e-Mini S&P 500 futures B&H NAV 


start Dec 1, 2009 $100,000 
as of Dec 31, 2011 $147,795 


Figure 1.29: Private Account Managed by C. S. Dzhabarov and W. T. Ziemba, December 1, 2009 
to December 31, 2011. 


The following are some conclusions from the 2011 edition of the Stock Trader’s 
Almanac plus comments regarding how this compares to our own research. (From 
“The best of times for stocks may be ahead,” CNBC, October 26, 2010). 


1. The markets sweet spot during any four-year presidency cycle is historically the 
fourth quarter of the midterm year and first quarter of the pre-election year. 
Also there has not been a down third year of a presidential term since prewar 
1939 when the Dow Jones Average fell —2.3%. This agrees with our Presidential 
election calculations. 

2. History also shows the markets have historically performed the best when there 
is a Democrat in the White House and a GOP-controlled Congress, which is what 
we now have. Over the last 70 years, in fact, such combinations on Capitol Hill 
produced an average annual return of 15% or more. 

This is consistent with our calculation but we did not specifically investigate 
Democrat in White House with a Republican Congress. 

3. The months between November and April are generally good to investors. Since 
1950, the Dow Industrials have returned an average 7.4% between those months, 
versus less than 1% (0.4%) between the May—October period. A hypothetical 
$10,000 investment during the best six months of the year over that 60-year 
period would have yielded a $527,388 profit, compared with a $474 loss for the 
worst performing six months of the year during the same period. 

This is the basic yearly seasonality and it agrees with our calculations. 

4. They argue that there would be no January barometer without the passage in 
1933 of the 20° (Lame Duck) Amendment to the constitution. Prior to the 
1933 Congress and the President took office in March following an election and 
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Congress did not need to meet until December. So the period from the Novem- 
ber election to March created a four month Lame Duck. With the amendment 
congressional sessions begin on January 3°¢ and presidents are inaugurated on 
January 20°. This narrows the Lame Duck session and moves many important 
things into January including the president’s state of the union address and 
annual budget setting priorities for the year and thus creating a momentum for 
the rest of the year. 


. From Yahoo! Finance, October 20, 2010: 


If history is any guide, the secular bear market that began in 2000 still has about 
seven more years before the bulls take over. And when they do, get ready for 
what Hirsch describes as a “super boom” in stocks. 

“The super boom’s not really going to kick off until 2017 after we shake out all 
this financial crisis,” according to the Hirsches. They predict that the Dow will 
hit 38,820 in 2025, a 500% gain from the intraday lows of March 2009. 

“From the last bottom in 1974 it took eight years before the market really took 
off in 1982 and then another eight to move up the rest of the 500%, in line with 
Yale Hirsch’s prediction in 1976 for a 500% market move by 1990. A 500% rise 
in the Dow over 16 years from the intraday low of 6470 on March 6, 2009 would 
put the Dow at 38,820 in 2025.” 

It’s more than just wishful thinking — it’s a return to the market’s norms. It 
only takes a 7%-8% gain, compounded annually, to get to 38,820, they note. In 
fact, the Dow rose 1,400% during the bull market of 1982 to 2000. 

What, then, will drive this super boom? 

First, the end of war brings peace and a generally pro-business environment. It 
happened after World War II and Vietnam, and the Hirsches bet that it will 
happen after the wars in Iraq and Afghanistan. 

Second, inflation. “After all the major wars of the 20th century,” stocks took off, 
“when the inflation from government spending kicks in.” 

Third, enabling technologies. They point to the assembly line after World War 
I, which revolutionized business; suburban expansion after World War II that 
created a middle-class consumer in need of appliances and more homes; and 
after Vietnam it was IT and communications that sparked innovation. Renewable 
energy and biotechnology, they believe, will be the next big thing. 


Well, we will see. This is the story of compounding. So if earnings grow by 7-8% 
per year, the stock prices may well have these gains. 


Table A.1: Russell 2000 —S&P500 Futures Spread Average Returns by Month, 1993-2011. 


MOY Trading R2000-S&P500 futures spread. Geometrically Linked Returns. 
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Table A.2: 


S&P500 Futures Average Monthly Returns, 
MOY Trading in S&P 500 futures contract. Geometrically Linked Returns, 


1993-2011 and 2004-2011. 
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Table A.3: 


Russell 2000 Futures Average Monthly Returns, 1993-2011 and 2004-2011. 
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TOM Trading in S&P 500 futures. Geometrically Linked Returns. 


Table A.4: S&P500 Futures Turn-of-the-Month Returns, 1993-2011 
M Trading in S&P 500 futures. Geometrically Linked Returns. 
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Table A.5: Russell 2000 Futures Turn-of-the-Month Returns, 1993-2011 and 2004-2011. 
TOM Trading in Russell 2000 futures. Geometrically Linked Returns, Entry on close 5th Business Day before End of Month. 
Exit on close 2nd Business Day of Next Month. 


193| 1994| 1995| 199%., 1997| 19%, 199| 2000| 2001. 2002| 2003| 2004| 2005 2006 2007| © 2008) 2009, 2010 2011 Average [StDev t -A 
Jan 0.99% | 0.25%| -0.77% 2.01%) 3.21% 4.04%) -1.51%| 3.73% 2.20%] 0.89%) 2.91%) -2.67% 0.22% 1.20%] 5.90%) 681%) 051%| -0.79% 0.96% 0.028 1.441, 
Feb 259%| 0.03% 206% O71%| 411% -0.67%| -253%) 0.11% -183%| 0.91%) 4.04%] 4.05%, 0.85% 3.19%] 4.62% 0.45%] -0.62%| 213% 0.90% 0.024) 1.607) 


1.70%| 025% -1.23%| 1.31%) -1.28%| 5.32%| -0.67% 485%) 0.72%) 3.26%| 3.05%) 1.09% -7.03%| 4.37%] -1049%| 252%] -041% -0.08% 0.039) -0.087) 
0.28%} 1.56% -4.00%| 216%) 2.40%| -12.96%| -5.56% -0.15%| 0.93%) 5.75%] -0.15% 1.00% 048%| 1.08%| 581%| 0.72%) 3.23% -0.16% 0.043] -0.163) 
0.70% 0.54% 5.54%| 0.72% -1,57%| 10.28%] 6.07% 1.19%| 381%| 4.17%] 0.66% 0.19% -0.20% 1.98% 5.25%) -3.67%| 0.13% 1.72% 0.035, 2,139 
O74%| 0.61% 085%| -3.16%| -085%| 10.05%) -081% 4.28%] 6.15%) 3.83%] 214% 137% 345%] 1.85%] 10.32%] 297%] 1.21% — 1.92% 0.038) 2.208) 
0.96%| -0.75% 0.30%! 0.97%) 2.71%] 0.63%] 239% -6.26%| 4.50%) 0.76%] 4.20% 4.37% 2.26%| 5.28% 0.77% -6.06%| 5.83% 0.67% 0.034) 0.855) 
0.67%] 380% 1.64%] 842%) -1.48%] 3.21%) 254% -0.76%| 1.23%] 1.69%] 211%) 045% -336%| 068%] 3.83%) -1.07%| -8.18% -0.52% 0.034] 0.009) 
0.87%! 083% 222%] -9.71%) -298% 2.33% 3.61% 6.62% 467%| 246%] 080% 338% 047%| 277%| 4.76%] 444%] 0.66% -0.03% 0.038) -0.031' 
268%] 1.51% 221%] 6.16% 265%) -203%| 230%) 1.61%) -046%| 4.11%] 0.20%) -143%] 285% 8.92%| -350%| 0.18%] -386% -0.58% 0.034) -0.742 
Nov 1.51%| 1.42%| 1.90%| 0.80%. 7.21%| 3.68% 3.50%| 6.84%) -140% 342%] 438%| 206%| 210%| -273% -1.70%| 22.54%| -4.63%| 0.86%] 2.20% 276% 0.057) 2.125 
Dec) 1.88%] 1.06%] 4.23%| 226% -0.49%| 046%) 1.41%| -4.75%| 1.74%) 048%] 264%| 1.79%| 1.11%| 072%) 262%| 065%| 061%| 213%| 878% 1.49% 0.025 2581] 


Average 1.48% 0.42% 0.69% 0.78% 1.41% -0.98% 066% 0.60% 0.58% -051% 221% 1.70% 1.36% 0.79% 0.35% 086% 087% 0.24% 0.9% 0.75% 
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t 3.39%| 0.736| 1.463| 1818 1.672) -0.711, 0961| 0.309) 0.617) -0.486| 3.100) 1.959) 2347 1471 0.394) 0.374| 0.508) 0.290| 0.731 3.033) 
Geomr | 15.68%} 4.97% | 8.43%) 9.66% 17.80%) -1229% 7.86%) 4.79%) 6.52% -6.68%| 29.52%) 21.85%| 17.31% 9.70% 3.77%| 7.35%) 8.85%) 247%) 10.38% 
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Table A.6: Data by Month for Sell in May and Go Away Versus Buy and Hold for the S&P500, 1993-2011. 
SIM Trading in SP500 futures contract. Geometrically Linked Returns. Entry at Close on 6th Business Day before End of October, Exit Close of 1st Business Day of May. 


T 


2003 2004 2008) 2009| 2010, 2011/Average StDev t 
Jan 149 69%| O. 2.98% 1.82% ~682%| -9.14%| 3.72%) 230% -0.28% 0.041 
6.30%| 3.76% 173%) 1.28% -3.63%| -11.13%| 2.98% “1.21% 0.083 
5.65%] 4.49% 044% -182% -0.85%| 7.28%] 557% 1.51% 0.042 
0.70%) 319% 8.00% -1.73% : 455%] 9.07%| 147% 1.85% 0.040 
1.35%| 201% 0.02% 0.97% | 1.99% 0.71%] 1.29% 0.81% 0.007) 
0.49%) 0.43% 0.10% 0.09% 0.17%) 0.02%| 0.02%) 0.01% 0.28% 0.002 
047%] 041% 0.09% 011% 7 0.17%| 0.01%! 001% 
047%) 044% 0.08% 0.13% 0.16%| 0.01%) 0.02% 
0.45%) 043% 0.09%. 0.13% 0.16% 0.01%! 0.02% 
16%| -0.07% 
5.06%) 1.04% 0.75% 383% -9.22%| 588%| -0.11% 
| 180%] -1.93% 6.92%) 6.60% 482% 334% 027%) 1. -040%| 1.40%) 6.18% 
1.64% 115% 1. 2.60% 2.00% 0.96% 0.95% -0.24% 0.00% 1.14% 
ooa) 0.021! o. 0026| 0.029 0.029. 0.019 I 0.051] 0060| 0.027 
1.868 3.463) 2430|) 0. L149 1776 D160) -0001| 1.459 


14.40% | 35.62% | 26.34% ‘40% | 11.67% 1186%| 1. -4.19%| -202% 


ntract. Geometrically Linked Returns. 
1997. 1998) 1999 2003 2004 2008. 2009) 2010. 2011 Average StDev 
5.69%| 0.73% : -298% 1.82% -682%| -9.14%| -3.72%| 230% -0.28% 0.041 
0.28%| 6.30% 173%) 1.28% -3.63%| -11.13%| 2.98%] 335% -1.21% 0.043 
424%! 5.65% 0.44% -182% -0.85%| 7.28%| 5.57%| -0.51% 151% 0.042 


5.69% 0.70% “| 8: : : 9.07%| 147%) 289% 1.85% 0.040 
5.84%] -2.60% 96%] 5.27%| -8.41%| -1.21% 
452%| 461% 5.94%] -223% 0.040 
7.47%] -1.86% ; 734%| 6.80%) -216% 
5.91%) -15.59% 05% 4%] 0. 3.46% 4.69%) -6.40% 
551%| 6.86% ji f 3.16%| 831%| -7.85% 
378%| 738%) 573% 551%. 132% 00% | -20.11%| -207%| 3.74%) 10.51% 
3.21%| 506%] 1.04% 0.75% 3.83% -922%| 588%] -0.11%| -0.61% 
240%] 6.92%) 6.60% 482% 334%) 0: 1.80%) -0.56%| -0.40%| 140%] 6.18%] 037% 
2.22% 20% 140% 189% 0.72% 1.06% 0.21% -452% 1.67% 1.02% 
0.046| o066) 0.039| o. ; 0.033) 0.021 0.017| 0.029| 0.067) 0.064| 0.056 
1687| 1.063) 1.229 L97 1172 2135| 0257| -2321| 0.903 0.634 


33.54% | 19.34% | 28.75%| 23.93%) 17.18%| -1273%| -15.61%| -26.10%| 24.53% 8.76%| 285%| 13.29%| 2.11% | -44.22%| 19.28% 
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Table A.7: Data by Month for Sell in May and Go Away Versus Buy and Hold for the Russell 2000 Futures, 1993-2011. 


SIM Trading in Russell 2000 futures contract. Geometrically Linked Returns. Entry at Close on 6th Business Day before End of October, Exit 1st Business Day of May. 
T 


1994 1996, 1997 1998, 1999 2000 2011 Average StDev 


2.95%] -1.60%| -0.32%| 2.04%] -248%| 0.32%) -3.17% -041% -0.96% 0.047 
-143%| 4. L58% | -287%| 7.31%| -8.02%| 16.64% | 5.34% -0.34% 0.064 
443% 348%| -4.20%| 5.20%| 1.15%) -6.53% 208% 1.32% 0.043 
-0.55%| 23 4.59%| -0.10%| -0.13%| 847%] -7.92% 256% 188% 0.056 
1.01%) 1.65%] 0. 1.07%] 1.70%] 0.94%] 1.58%) 3.29% -131% 0.94% 0.010 
0.27%) 0.35%) 0. 040%| 047%) 049%/ 043%) 054% 0.01% 0.28% 0.002 
0.25%| 0.34%] 0: 049%| 047%) 0.47%| 041%) 0.56% 0.01% 0.29% 0.002 
0.27%) 0.40%) 0. 043%! 0.44%) 047%! 0.44%) 0.55% | 0.01% 0.29%  0.002/ 
0.25%) 0.39%] 0. 045%] 049%| 045%| 043%) 0.52%| 0.24%) 0.15%) O 0.01% 0.28% 0.002 
147%| 2.27%) 0.13%| -0.25%| 5.75% 1.82%| 3.11%] 5.39%] 214%] 0.12% 088% 1.88% 0.042 
4.28%) 4.75%) 4. 3.89%| -1.16%| 4.28%] 4.69%) -10.54%| 7.38%! 8.50% “1.00% 0.71% — 0.064 
412% 281%] 3. 201%] 2.05%| 6.76%| 1258%| 8.02%] 5.94%| -5.87% | 0.03% 3.47% 0.040 


Average -0.02% 0.00% 1. 1.49% 0.42% 213% 213% 0.61% 1.09% 0.67% 1 0.68% 0.85% 
StDev a 0.025 J 0.017) 0.025} 0.030} 0.050) 0.074] 0.043) 0.040 . . 0.018 
t : -0.029 


0.004) 2. 3.090 0579| 2432| 1.474) 0.286| = 0.866) 0.581 


Geomr | -0.57% | -0.31% 17. 19.18% | 4.83%| 28.12%| 27.12%) 4.49% 746% 


B&H Trading in Russell 2000 futures contract. Geometrically Linked Returns. 


1993, 1994 1995) 1996 1997) 1998; 1999 2000 2001 2011 Average StDev 
Jan 2.95% | -1.60%] -0.32%| 2.04%) -248%| 0.32%) -3.17%| 3.91% ji -0.41% -0.96% 0.047 
Feb| -4.20%| -1.43%| 4.17%| 1.58%| -2.87%| 7.31%| -8.02%| 16.64%| -7.33% 3 5.34% -0.40% 0.064 
259%] -4.43%| 138%| 3.48%] -4.20%| 5.20%| 1.15%] -6.53%] -3.92% 208% 132% 0.043 
3.05%! -0.55%| 239%| 4.59%] -0.10%| -0.13%| 8.47%) -7.92%| 6.33% 256% 188% 0.056 
4.00% | -1.16%| 2.21%| 3.17% 10.25%) 6.34%) 1.64%) -7.30%| 1.83% j| %| . -2.02% 0.80% 0.055 
151%|-349%| 4.64%] -3.66%| 4.75%| 1.13%| 4.66%) 9.34%] 3.00% -283% 0.44% 0.047 
055%) 2.00%) 528%] -932%| 4.13%) -9.34% 388%) 352%) -504% | 378% -1.39% 0.068 
3.95%| 5.42%) 144%) 5.01% 1.98% | -20.31% | 3.99%] 6.08% -3.70% i -10.27% -0.64% 0.065} 
247% | -0.58%| 182%| 4.17%] 7.70%] 8.02%| 0.36%] -1.99%| -13.82% “12.25% 0.15% 0.068 
2.96% | -0.40%| -5.11%| -1.50%| -513%| 3.75% 044%) -548%| 524% T414% -0.12% 0.078 
-128%| -4.75%| 442%| 3.89%] -1.16%| 4.24%| 4.69%] -10.54%| 7.38% “1.00% 0.71% 0.064 
4.12%| 284%| 3.13%) 201% 2.05% | 6.76%| 12.58% 8.02%| 5.94% -0.03% 3.47% 0.040 
0.97% -0.30% 2.02% 1.09% 1.62% -0.18% 1.53% -0.54% -0.02% -0.71% 0.45% 
0.033! 0.031) 0.029! 0.042| 0.047| 0.084! 0.056) 0.085) 0.066 . J I I . 0.069 
| 0.976) -0.329) 2.388] 0.896] 1.205] -0076| 0.948 -0219| -0.009 231) 1.098) 722| 0.924) -0.357 


|Geomr | 10.56% | -4.03%| 26.46% | 12.78%| 19.88%| -6.17%| 18.06%] -9.77%| -2.67% 15.38% | -4.95%| 42.69%) 18.43%] 21.75%) -10.51%] | | | 


SIYDUOUY LOPUD — woyonpoujuy :F 4azdvyQ 


el 


Count 
StDev 
Average 
z 
Positive 


Count 
StDev 
Average 
z 
Positive 


StDev 


Average 
z 
Positive 


Monthly 


28 


0.0146 0.01 


0.17% 
0.6103 
57.1% 

-12 


95 
0.0154 
0.18% 
1.1299 
58.9% 

-11 


46 
0.0162 
0.11% 
0.4530 
52.2% 

-11 


167 
0.0144 
0.05% 
0.4820 
47.9% 

-10 


0.25% 


% 163 206 
14 00120 0.0117 


72 
0.0132 
0.03% 
0.1716 
51.4% 

-10 


208 
0.0147 
-0.06% 
-0.6003 
49.0% 
-9 


0.07% -0.08% 
0.7823 -0.9972 
528% 485% 

-10 -9 


89 


225 225 226 
0.0112 0.0126 0.0128 
0.05% -0.07% -0.16% 
-0.6016 -0.8761 -1.8867 


493% 489% 


8 


7 


46.9% 


7 4 


7 


0.0133 0.0123 0.0150 
-0.08% -0.09% 0.00% 
-0.5809 -0.7024 -0.0092 
55.1% 526% 557% 


-9 


-$ 


-7 


2235 225 226 


0.0153 


0.0147 


-0.09% -0.08% 
-0.8813 -0.8028 


50.7% 
8 


49.8% 
7 


0.0140 
-0.08% 
-0.8300 
44.2% 
6 


97 


0.0133 0.0117 


227 
0.0120 
0.0% 
0.5326 
54.2% 

5 


7 


-0.17% -0.07% 


-1,2446 -0.6083 


49.5% 48.5% 


4 


226 
0.0143 


5 


227 
0.0140 


7 
0.0161 
0.31% 
1.9029 
56.7% 
4 


0.0166 


227 


227 


0.0137 0.0117 


0.13% 


0.08% 


1.3802 0.9724 


53.3% 
4 


7 


0.32% 


4 


227 
0.0156 
0.18% 
1.7580 
60.8% 

3 


97 
0.0188 
0.38% 
1.9818 
61.9% 
-3 


0.20% 


55.1% 
3 


7 


-3 


226 
0.0148 
0.10% 
0.9965 
58.0% 

-2 


226 


0.0134 0.0117 
0.00% -0.04% 
0.0411 -0.5189 


51.3% 
-2 


96 


0.0116 00155 0.0113 
0.00% -0.05% 
1.8854 1.6717 -0.0157 -0.3957 
59.8% 61.9% 50.0% 46.9% 


-2 


225 226 26 26 226 22 26 26 22 26 22% 22 


4 


96 


-1 


0.0138 
0.26% 
2.8081 
478% 628% 


1 


96 
0.0161 


0.0124 


0.2332 
55.3% 


0.08% 


57.3% 526% 


2 3 


96 97 
0.0121 0.0112 0.0151 0.0117 0.0142 0.0159 0.0110 
0.00% -0.11% -0.09% -0.04% -0.06% -0.10% 
0.6760 -0.0159 -0.7248 -0.7635 -0.2770 -0.3646 -0.9072 
54.6% 474% 53.6% 577% 546% 


2 3 


226 2265 226 226 


0.0175 
0.08% 
0.7302 
59.3% 

1 


0.0154 
0.05% 
0.5051 
54.4% 

2 


0.0137 0.0150 
0.02% -0.06% 
0.2712 -0.5681 
51.8% 518% 

3 4 


97 97 
0.0151 0.0178 
-0.08% -0.17% 
-0.4977 -0.9532 
474% 53.6% 
3 4 


0.0111 0.0128 
0.02% 0.06% -0.04% 
0.7700 -0.5120 
54.9% 50.0% 


4 


97 


4 


226 
0.0144 
-0.05% 
-0.5461 

48.2% 
5 


97 
0.0168 
-0.21% 
-1.2491 
42.3% 
5 


0.01% 


97 


5 


5 


226 
0.0166 
-0.20% 


7 


6 


6 


226 
0.0162 
0.04% 


0.0115 0.0123 0.0137 0.0113 
0.07% -0.09% -0.04% 
0.1251 -0.8662 -0.%08 -0.5209 
504% 491% 544% 527% 


7 8 


97 97 


7 8 


226 
0.0144 
0.04% 


-1.8028 -0.3765 -0.3838 


50.4% 
6 


97 
0.0195 
0.14% 


50.9% 
7 


7 
0.0208 
-0.09% 


54.4% 
8 


7 
0.0158 
-0.05% 


-0.6879 -0.4274 -0.3403 


48.5% 
6 


53.6% 
7 


53.6% 
8 


0.0152 
0.17% 
1.6528 
55.3% 

9 


0.0120 
0.02% 
0.2433 
58.4% 

10 


0.0139 
0.15% 
1.6491 
60.2% 

n 


110-661 
‘oosa7ys (e) 


‘oosays (4) 


“TLOZ-P00Z PUR TTOZ-E66T ‘SUM yoy 


Aeq Seay somyng 0007 MOSSNY ‘O09dPS WOW- “gy ABL 


TL0Z—-F00z 


TLOG-€66T 


0002 messny (9) 


T10Z—-p007 
000z messny (p) 


tL 


abouiguy PUD sayvULOUy LOPUD 


Chapter 1: Introduction — Calendar Anomalies 75 


References 


Abraham, A. and D. L. Ikenberry (1994). The individual investor and the weekend effect. 
Journal of Financial and Quantitative Analysis 29(2), 263-277. 

Ariel, R. A. (1987). A monthly effect in stock returns. Journal of Financial Economics 18, 
161-174. 

Ariel, R. A. (1990). High stock returns before holidays: Existence and evidence on possible 
causes. Journal of Finance 45, 1611-1626. 

Arsad, Z. and J. A. Coutts (1997). Security price anomalies in the London international 
stock exchange: A 60 year perspective. Applied Financial Economics 7, 455-464. 
Banz, R. (1981). The relationship between return and market value of common stock. 

Journal of Financial Economics 9, 3-18. 

Bialkowski, J. P., A. Etebari, and T. P. Wisniewski (2009). Piety and profits: Stock market 
anomaly during the Muslim Holy Month. Finance and Corporate Governance Confer- 
ence 2010 Paper. Available at SSRN. 

Blume, M. E. and R. R. Stambaugh (1983). Biases in computed returns: An application 
to the size effect. Journal of Financial Economics 12, 387—404. 

Bohl, M. T. and C. A. Salm (2010). The other January effect: International evidence. 
European Journal of Finance 16, 173-182. 

Booth, D. G. and D. Keim (2000). Is there still a January effect? In D. B. Keim and 
W. T. Ziemba (Eds.), Security Market Imperfections in World Wide Equity Markets, 
pp. 169-178. Cambridge, UK: Cambridge University Press. 

Bouges, J. C., R. Jain, and Y. R. Puri (2009). American depository receipts and calendar 
anomalies. Applied Financial Economics 19(1), 17-25. 

Bouman, S. and B. Jacobsen (2002). The Halloween indicator, sell in May and go away: 
Another puzzle. American Economic Review 92(5), 618-1635. 

Branch, B. S. and A. Ma (2006). The overnight return, one more anomaly. Available at 
SSRN. 

Britten-Jones, M. (1999). The sampling error in estimates of mean-variance efficient port- 
folio weights. Journal of Finance 52(2), 655-671. 

Bronson, R. E. (2011). What January effect? Available at www.ritholtz.com/ 
blog/2011/02/what-january-effect-? 

Brown, L. D. and L. Luo (2006). The January barometer: Further evidence. Journal of 
Investing 15(1), 25-31. 

Cadsby, C. B. and M. Ratner (1992). Turn-of-the-month and pre-holiday effects on stock 
returns: Some international evidence. Journal of Banking and Finance 16, 497-509. 

Canestrelli, E. and W. T. Ziemba (2000). Calendar anomalies in the Italian stock market, 
1973-1993. In D. B. Keim and Ziemba (Eds.), Security Market Imperfections in World 
Wide Equity Markets, pp. 337-363. Cambridge, UK: Cambridge University Press. 

Cervera, A. and D. B. Keim (2000). High stock returns before holidays: International 
evidence and additional tests. In D. B. Keim and W. T. Ziemba (Eds.), Security 
Market Imperfections in World Wide Equity Markets, pp. 512-531. Cambridge, UK: 
Cambridge University Press. 

Chan, S. H., W.-K. Leung, and K. Wang (2004). The impact of institutional investors on 
the Monday seasonal. Journal of Business 77(4), 967-986. 

Chen, H. and V. Singal (2003). Role of speculative short sales in price formation: The case 
of the weekend effect. Journal of Finance 58(2), 685-706. 

Chopra, V. and W. T. Ziemba (1993). The effect of errors in mean and co-variance esti- 
mates on optimal portfolio choice. Journal of Portfolio Management (Winter), 6-11. 

Choudhry, T. (2000). Day of the week effect in emerging asian stock markets: Evidence 
from the GARCH model. Applied Financial Economics 10(3), 235-242. 


76 Calendar Anomalies and Arbitrage 


Chukwuogor-Ndu, C. (2006). Stock market returns analysis, day-of-the-week effect, volatil- 
ity of returns evidence from European financial markets 1997-2004. International 
Research Journal of Finance and Economics 1, 112-124. 

Clark, R. and W. T. Ziemba (1988). Playing the turn-of-the-year effect with index futures. 
Operations Research 35, 799-813. 

Connolly, R. A. (1989). An examination of the robustness of the weekend effect. Journal 
of Financial and Quantitative Analysis 24(2), 133-169. 

Cooper, M. J., M. T. Cliff, and H. Gulen (2008). Return differences between trading and 
non-trading hours: Like night and day. Available at SSRN. 

Cooper, M. S., J. J. McConnell, and A. V. Ovtchinnikov (2006). The other January effect. 
Journal of Financial Economics 82(2), 315-341. 

Cross, F. (1973). The behavior of stock price on Fridays and Mondays. Financial Analyst 
Journal 29, 67-69. 

Dimson, E. (Ed.) (1988). Stock Market Anomalies. Cambridge, UK: Cambridge University 
Press. 

Dimson, E. and P. Marsh (1999). Murphy’s Law and market anomalies. Journal of Portfolio 
Management 25, 53-69. 

Doeswijk, R. (2005). The optimism cycle: Sell in May. De Economist 156(2), 175-200. 

Dubois, M. and P. Louvet (1996). The day-of-the-week effect: The international evidence. 
Journal of Banking and Finance 20, 1463-1484. 

Dzhabarov, C. and W. T. Ziemba (2010). Do seasonal anomalies still work? Journal of 
Portfolio Management 86(3), 93-104. 

Dzhabaroyv, C. and W. T. Ziemba (2011). Seasonal anomalies. In L. Zacks (Ed.), The 
Handbook of Equity Market Anomalies, Chapter 9, pp. 205-264. New York: Wiley. 

Eagles, J. M. (1994). The relationship between mood and daily hours of sunlight in rapid 
cycling bipolar illness. Biological Psychiatry 36, 422—424. 

Easterday, K. E., Sen, and Stephan (2008). The persistence of the small firm/January 
effect: Is it consistent with investors’ learning and arbitrage efforts? The Quarterly 
Review of Economics and Finance 49(3), 1172-1193. 

Easton and Pinder (2007). A refutation of the existence of the other January effect. Inter- 
national Review of Finance 3-4(7), 89-104. 

Fama, E. F. (1970). Efficient capital markets: A review of theory and empirical work. 
Journal of Finance 25, 383-417. 

Fama, E. F. (1991). Efficient capital markets II. Journal of Finance 46(5), 1575-1617. 

Fama, E. F. (1998). Market efficiency, long-term returns and behavioral finance. Journal 
of Financial Economics 25, 383-417. 

Fama, E. F. and R. French (1992). The cross-section of expected stock returns. Journal of 
Finance 47, 427-465. 

Ferguson, M. F. and H. D. Witte (2006). Congress and the stock market. Available at 
SSRN. 

Fosback, N. G. (1976). Stock market logic. The Institute for Economic Research, Fort 
Lauderdale. 

French, K. (1980). Stock returns and the weekend effect. Journal of Financial Economics 8, 
55-70. 

Fung, W., D. A. Hsieh, N. Y. Naik, and T. Ramadorai (2006). Hedge funds: Performance, 
risk and capital formation. Technical report, London Business School. 

Garrett, I., M. Kamstra, and L. A. Kramer (2004). Winter blues and time variation in the 
price of risk. Journal of Empirical Finance 12(2), 291-316. 

Gergaud, O. and W. T. Ziemba (2012). Evaluating great traders. Journal of Portfolio 
Management (Summer, in press). 

Ghosh, S., G. Bhalla, and W. T. Ziemba (2007). The January barometer, 1926-2005. 
Technical Report, Sauder School of Business. 


Chapter 1: Introduction — Calendar Anomalies 77 


Gibbons, R. S. and P. Hess (1981). Day of the week effects and asset return. Journal of 
Business 54, 579-596. 

Gonzalez, M. (2006). Buying stock? Consider turn of the month effects. Wall Street 
Journal. 

Gorenstein, P. (2010). A “Super Boom” in stocks? Dow 38k by 2025, says the Stock 
Traders Almanac. Investing/Yahoo Finance. 

Gultekin, M. and N. B. Gultekin (1983a). Stock market seasonality: International evidence. 
Journal of Financial Economics 12, 469-481. 

Gultekin, M. N. and N. B. Gultekin (1983b). Stock market seasonality: International evi- 
dence. Journal of Financial Economics 12, 469-482. 

Hansen, P. R., A. Lunde, and J. M. Nason (2005). Testing the significance of calendar 
effects. Federal Reserve Bank of Atlanta, Working Paper No. 2005-2002. 

Harris, L. (1986). A transaction data study of weekly and intra daily patterns in stock 
returns. Journal of Financial Economics 16, 99-117. 

Haug, M. and M. Hirschey (2006). The January effect. Financial Analysts Journal 62(5), 
78-88. 

Hensel, C., G. Sick, and W. T. Ziemba (1994). The turn of the month effect in the S&P500, 

1926-1992. Review of Futures Markets 13(3), 827-856. 

Hensel, C., G. A. Sick, and W. T. Ziemba (1996). Investment results from exploiting turn- 

of-the-month-effects. Journal of Portfolio Management (Spring), 17-23. 

Hensel, C. and W. T. Ziemba (1995a). The January barometer. Journal of Investing 4(2), 
67—70-856. 

Hensel, C. and W. T. Ziemba (1995b). The January barometer: Swiss, European and global 

results. Finanzmarket and Portfolio Management 9(2), 187-196. 

Hensel, C. and W. T. Ziemba (1995c). US investment returns during democratic and 

republican administrations, 1928-1993. Financial Analysts Journal (March-April), 
61-69. 

Hensel, C. R. and W. T. Ziemba (2000a). Anticipation in the January effect in the U.S. 
futures markets. In D. B. Keim and Ziemba (Eds.), Security Market Imperfections 
in World Wide Equity Markets, pp. 179-202. Cambridge, UK: Cambridge University 
Press. 

Hensel, C. R. and W. T. Ziemba (2000b). How does Clinton stand up to history? U.S. 
investment returns and presidential party affiliations. In D. B. Keim and Ziemba 
(Eds.), Security Market Imperfections in World Wide Equity Markets, pp. 179-202. 
Cambridge, UK: Cambridge University Press. 

Herbst, A. F. and C. W. Slinkman (1984). Political-economic cycles in the U.S. stock 
market. Financial Analysts Journal 40(2), 38-44. 

Hirsch, J. A. and Y. Hirsch (2011). Stock Traders Almanac. New York: Wiley. 

Hirsch, Y. (1986). Don’t Sell Stocks on Monday. New York: Facts on File Publications. 

Hirshleifer, D. and T. Shumway (2003). Good day sunshine: Stock returns and the weather. 
The Journal of Finance 58(3), 1009-1032. 

Hobbs, G. R. and W. B. Riley (1984). Profiting from a presidential election. Financial 
Analysts Journal 40(2), 46-52. 

Hou, K. and D. T. Robinson (2006). Industry concentration and average stock returns. 
Journal of Finance 6(4), 1927-1956. 

Huang, R. D. (1985). Common stock returns and presidential elections. Financial Analysts 
Journal 41(2), 58-65. 

Hudson, R., K. Keasey, and K. Littler (2002). Why investors should be cautious of the 
academic approach to testing for stock market anomalies. Applied Financial Eco- 
nomics 12, 681-686. 

Jacobs, B. I. and K. N. Levy (1988a). Calendar anomalies: Abnormal returns at calendar 
turning points. Financial Analysts Journal 44, 28-39. 


78 Calendar Anomalies and Arbitrage 


Jacobs, B. I. and K. N. Levy (1988b). Disentangling equity return regularities: new insights 
and investment opportunities. Financial Analysts Journal 44, 18-43. 

Jacobs, B. I. and K. N. Levy (1988c). On the value of value. Financial Analysts Journal 44, 
47-62. 

Jacobsen, B., A. Mamun, and N. Visaltanachoti (2005). Seasonal, size and value anomalies. 
Available at SSRN. 

Jaffe, J. and R. Westerfield (1985a). Patterns in Japanese common stock returns: Day of 
the week and turn of the year effect. Journal of Financial Quantitative Analysis 20, 
261-272. 

Jaffe, J. and R. Westerfield (1985b). The week-end effect in common stock return: The 
international evidence. Journal of Finance 40, 433-454. 

Jaffe, J., R. L. Westerfield, and C. Ma (1989). A twist on the Monday effect in stock prices: 
Evidence from the US and foreign stock markets. Journal of Banking and Finance 13, 
641-650. 

Jagannathan, R., A. Malakhov, and D. Novikov (2006). Do hot hands persist among hedge 
fund managers? An empirical evaluation. Technical report, Northwestern University. 

Jegadeesh, N. and S. Titman (1993). Returns to buying winners and selling losers: Impli- 
cations for stock market efficiency. Journal of Finance, 65-91. 

Kamara, A. (1997). New evidence on the Monday seasonal in stock returns. Journal of 
Business 70(1), 63-84. 

Kamstra, M. J., L. A. Kramer, and M. D. Levi (2000). Losing sleep at the market: The 
daylight-savings anomaly. American Economic Review 90, 1005-1011. 

Kamstra, M. J., L. A. Kramer, and M. D. Levi (2003). Winter blues: A SAD stock market 
cycle. Federal Reserve Bank of Atlanta, Working Paper No. 2002-13a, Sauder School 
of Business, Working Paper. 

Kandel, S. and R. F. Stambaugh (1996). On the predictability of stock returns: An asset 
allocation perspective. Journal of Finance 51, 385—424. 

Kato, K. (1990). Weekly patterns in Japanese stock returns. Management Science 36, 
1031-1043. 

Kato, K., S. L. Schwartz, and W. T. Ziemba (1989). Day of the week effects in Japanese 
stocks. In E. J. Elton and M. J. Gruber (Eds.), Japanese Capital Markets, pp. 249-281. 
Ballinger. 

Keef, S. P. and M. L. Roush (2007). A meta-analysis of the international evidence of cloud 
cover on stock returns. Review of Accounting and Finance 6(3), 324-338. 

Keim, D. B. (1983). Size related anomalies and stock return seasonality: Further empirical 
evidence. Journal of Financial Economics 12, 3-32. 

Keim, D. B. (1989). Trading patterns, bid-ask spreads, and estimated security returns: 
The case of common stock returns at the turn of the year. Journal of Financial Eco- 
nomics 25, 7598. 

Keim, D. B. and R. F. Stambaugh (1984). A further investigation of the weekend effect in 
stock returns. Journal of Finance 39(3), 819-835. 

Keim, D. B. and W. T. Ziemba (Eds.) (2000). Security Market Imperfections in World 
Wide Equity Markets. Cambridge, UK: Cambridge University Press. 

Kramer, W. and R. Runde (1997). Stocks and the weather: An exercise in data mining or 
yet another capital market anomaly. Empirical Economics 22, 637-641. 

Lakonishok, J. and M. Levi (1982). Weekend effects on stock returns: A note. Journal of 
Finance 87, 883-889. 

Lakonishok, J. and E. Maberly (1990). The weekend effect: Trading patterns of individual 
and institutional investors. Journal of Finance 45(1), 231-243. 

Lakonishok, J. and S. Smidt (1988). Are seasonal anomalies real? A ninety-year perspec- 
tive. Review of Financial Studies 1, 403-425. 


Chapter 1: Introduction — Calendar Anomalies 79 


Lleo, S. and W. T. Ziemba (2012). Stock market crashes in 2006-2009: Were we able to 
predict them. Quantitative Finance, July, 1-35. 

Lo, A. W. and A. C. MacKinlay (1990). Data-snooping biases in tests of financial asset 
pricing models. Review of Financial Studies 3(3), 431—467. 

Loughran, T. and P. Schultz (2004). Weather, stock returns and the impact of localized 
trading behavior. Journal of Financial and Quantitative Analysis 39(2), 343-364. 
Lucey, B. M. and A. Pardo (2005). Why investors should not be cautious about the 
academic approach to testing for stock market anomalies. Applied Financial Eco- 

nomics 15, 165-171. 

MacLean, L. C., E. O. Thorp, Y. Zhao, and W. T. Ziemba (2011). How does the For- 

tune’s Formula-Kelly capital growth model perform? Journal of Portfolio Manage- 

ment (Summer, in press). 

MacLean, L. C., E. O. Thorp, and W. T. Ziemba (Eds.) (2011). The Kelly Investment 

Criterion: Theory and Practice. Singapore: World Scientific. 

Malkiel, B. (2011). A Random Walk Down Wall Street (11 ed.). New York: W. W. Norton. 

Marquering, W., J. Nisser, and T. Valla (2006). Disappearing anomalies: A dynamic anal- 

ysis of the persistence of anomalies. Applied Financial Economics 16, 291-302. 

Martikainen, T., J. Perttunen, and W. T. Ziemba (1994). The turn-of-the-month effect in 

the world’s stock markets, January 1988 — January 1990. Finanzmarket and Portfolio 

Management 8(1), 41-49. 

Merrill, A. A. (1966). Behavior of Prices on Wall Street. The Analysis Press. 

Patterson, S. (2010). The Quants: How a New Breed of Math Whizzes Conquered Wall 
Street and Nearly Destroyed It. Crown Business. 

Penman, S. H. (1987). The distribution of earnings news over time and seasonalities in 
aggregate stock returns. Journal of Financial Economics 18, 199-228. 

Reinganum, M. (1981). A misspecification of capital asset pricing: Empirical anomalies 
based on earnings yields and market values. Journal of Financial Economics 9, 19-46. 

Rendon, J. and W. T. Ziemba (2007). Is the January effect still alive in the futures markets? 
Finanzmarket and Portfolio Management 21, 381-396. 

Riley, Jr., W. B. and W. A. Luksetich (1980). The market prefers republicans: Myth or 
reality. Journal of Financial and Quantitative Analysis 15(3), 541-559. 

Rind, B. (1996). Effects of beliefs about weather conditions on tipping. Journal of Applied 
Social Psychology 26, 137-147. 

Ritter, J. (1988). The buying and selling behavior of individual investors at the turn of 
the year. Journal of Finance 43, 701-717. 

Rogalski, R. (1984). New findings regarding day of the week returns over trading and 
nontrading period. Journal of Finance 39, 1603-1614. 

Roll, R. (1983). Vas ist dat: The turn of the year effect and the return premia of small 
firms. Journal of Portfolio Management 9(Winter), 18-28. 

Roll, R. (1992). Weather. In P. Newman, M. Milgate, and J. Eatwell (Eds.), The New 
Palgrave Dictionary of Money and Finance. London: Macmillan Press. 

Roll, R. (1994). What every CFO should know about scientific progress in financial 
economics: What is known and what remains to be resolved. Financial Manage- 
ment (Summer), 69-75. 

Rosenberg, B., K. Reid, and R. Lanstein (1985). Persuasive evidence of market inefficiency. 
Journal of Portfolio Management 11, 9-16. 

Ross, S. A. (2005). Neoclassical Finance. Princeton: Princeton University Press. 

Rozeff and Kinney (1976). Capital market seasonality: The case of stock returns. Journal 
of Financial Economics 3, 379-402. 

Saunders, Jr, E. M. (1993). Stock prices and Wall Street weather. American Economic 
Review 88, 1337-1345. 


80 Calendar Anomalies and Arbitrage 


Schwartz, S. K. (2010). The best of times for stocks may be ahead. CNBC.com, Tuesday, 
October 26. 

Schwartz, S. L. and W. T. Ziemba (2000). Predicting returns on the Tokyo stock exchange. 
In D. B. Keim and W. T. Ziemba (Eds.), Security Market Imperfections in World Wide 
Equity Markets, pp. 492-511. Cambridge, UK: Cambridge University Press. 

Stoll, H. R. and R. E. Whaley (1983). Trading costs and the small firm effect. Journal of 
Financial Economics 12, 5779. 

Stone, D. and W. T. Ziemba (1993). Land and stock prices in Japan. Journal of Economic 
Perspectives (Summer), 149-165. 

Stoval, R. (1992). Forecasting stock market performance via the presidential cycle. Finan- 
cial Analysts Journal 48(3), 5-8. 

Sturm, R. (2009). The “other” January effect and the presidential election cycle. Applied 
Financial Economics 19, 1-9. 

Sullivan, R., A. Timmerman, and H. White (2001). Dangers of data mining: The case of 
calendar effects in stock returns. Journal of Econometrics 105(1), 249-286. 

Thaler, R. H. (1992). The Winners Curse. New York: The Free Press. 

Thorp, E. O. and Ziemba, W. T. (2011). A tale of five investors: Response to private letters 
from Professor Paul A. Samuelson. Working Paper. 

Tietjen, G. H. and D. R. Kripke (1994). Suicides in California (1968-1977) — absence of 
seasonality in Los Angeles and Sacramento counties. Psychiatric Research 53, 161-172. 

Trombley, M. A. (1997). Stock prices and Wall Street weather: Additional evidence. Quert- 
erly Journal of Business and Economics 36, 11-21. 

Wachtel, S. (1942). Certain observations on seasonal movements in stock prices. Journal 
of Business (April), 184-193. 

Wang, K., Y. Li, and J. Erickson (1997). A new look at the Monday effect. Journal of 
Finance 52, 2171-2186. 

Yuan, K., L. Zheng, and Q. Zhu (2006). Are investors moonstruck? Lunar phases and stock 
returns. Journal of Empirical Finance 13, 1-23. 

Zacks, L. (Ed.) (2011). The Handbook of Equity Market Anomalies. New York: Wiley. 

Ziemba, R. E. S. and W. T. Ziemba (2007). Scenarios for Risk Management and Global 

Investment Strategies. New York: Wiley. 

Ziemba, W. T. (1988a). Discussion of “the buying and selling behavior of individual 

investors at the turn of the year” by Jay R. Ritter. Journal of Finance 43, 717-719. 

Ziemba, W. T. (1988b). Synopsis of “playing the turn of the year effect with index futures.” 

OR/MS Today 15, 37-38. 

Ziemba, W. T. (1989). Seasonality effects in Japanese futures markets? In Rhee and Chang 

(Eds.), Research on Pacific Basin Security Markets. Amsterdam: North Holland. 

Ziemba, W. T. (1991a). Fundamental factors in japanese stock returns, 1979 to 1989. 

Investing (Winter), 73-79. 

Ziemba, W. T. (1991b). Japanese security market regularities: Monthly, turn of the month 

and year, holiday and Golden Week effects. Japan and the World Economy 8, 119-146. 

Ziemba, W. T. (1993). Comment on “Why a Weekend Effect?” Journal of Portfolio Man- 

agement (Winter), 93-99. 

Ziemba, W. T. (1994a). Investing in the turn-of-the-year effect in the futures markets. 

Interfaces (May-June). 

Ziemba, W. T. (1994b). Worldwide security market regularities. European Journal of Oper- 

ational Research 74, 198-229. 

Ziemba, W. T. (2005). The symmetric downside risk sharpe ratio and the evaluation of 

great investors and speculators. Journal of Portfolio Management (Fall), 108-122. 

Ziemba, W. T. (2006). The great investors: A way to evaluate them. Wilmott, May. 


Chapter 1: Introduction — Calendar Anomalies 81 


N 


iemba, W. T. (2012). Is the 60-40 stock-bond pension fund rule wise? Journal of Portfolio 
Management (in press). 

Ziemba, W. T., W. Bailey, and Y. Hamao (Eds.) (1991). Japanese Financial Market 
Research. Amsterdam: North Holland. 

Ziemba, W. T. and C. Hensel (1994). Worldwide security market anomalies. In S. Howi- 
son, F. Kelly, and P. Wilmott (Eds.), Philosophical Transactions of the Royal Society, 
Series B, Number April, pp. 495-509. 

Ziemba, W. T. and S. L. Schwartz (1991a). The growth in the Japanese stock market, 1949- 
1990 and prospects for the future. Managerial and Decision Economics 12, 183-195. 
Ziemba, W. T. and S. L. Schwartz (1991b). Invest Japan. Chicago: Probus Publishing. 
Ziemba, W. T. and S. L. Schwartz (1992). Power Japan. Chicago: Probus Publishing. 
Zweig, M. (1986). Winning on Wall Street. New York: Warner Books. 


This page intentionally left blank 


OR PR ICE 


Chapter 2 


PLAYING THE TURN-OF-THE-YEAR EFFECT 
WITH INDEX FUTURES 


ROSS CLARK 


Midland Doherty Limited, Vancouver, British Columbia 


WILLIAM T. ZIEMBA 


University of British Columbia, Vancouver, British Columbia 
(Received March 1987; revision received August 1987; accepted August 1987) 


The “turn-of-the-year” effect is a well-documented stock market phenomenon in which low capitalization “small stocks” 
receive relatively higher returns than high capitalization “big stocks” on the last trading day of December and the first 8 
trading days of January. The difference in returns during this period is of the order of 10%. Strategies for buying and 
selling these small stocks may be profitable, but may also incur large transaction costs that eliminate most or all of the 
projected gains. In this paper, we show a preferable way to invest in order to exploit this anomaly: use a futures spread 
that is long in the small stocks and short in the big stocks. The optimal investment, which uses a modification of the 
capital growth criterion, is large and has a substantial expected gain with minimal risk. We have used this analysis 


successfully in managing investment accounts. 


his paper reviews the literature on the small-firm, 

turn-of-the-year effect and provides an invest- 
ment strategy for exploiting this empirical regularity. 
We begin by discussing the actual returns received 
from various investments over the last 60 years. The 
evidence indicates that small firms—that is, those with 
low capitalizations—greatly outperform other invest- 
ments. Most of the excess gains over larger capitali- 
zation firms occur in January, particularly in the first 
few days of the month. We examine this evidence and 
try to explain why this anomaly occurs and is so 
regular. (Indeed, it has been one of the most consistent 
of the stock market empirical inefficiencies.) 

William Ziemba’s interest in the facts and invest- 
ment potential of anomalies in the stock market arose 
in connection with a workshop he gave at the Los 
Angeles ORSA/TIMS meeting in April 1986 (Ziemba 
1986) and with his forthcoming book on market 
anomalies (Ziemba 1988). Ross Clark has been a 
commodity trader interested in technical and other 
speculative investment strategies. Together we have 
developed some decision rules on when and how 
much should be invested in the tum-of-the-year play, 
using index futures. We have used the strategy suc- 
cessfully in a number of private investment accounts. 


Subject classification: 197, 213 turn-of-the-year effect. 
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We first discuss the evidence for the turn-of-the- 
year effect, and then present our analysis of strategies 
for taking advantage of it. Finally, we describe our 
most recent experience in applying our preferred 
strategy. 


1. The Evidence 


Ibbotson Associates (1986) have considered the actual 
returns received from investments in United States 
assets with different levels of risk during the period 
1926-1985 (Figure 1). The results indicate that: 


* So-called “riskless investments”—T-bills, essen- 
tially—earn the rate of inflation, which averaged 
3.1% in those 60 years; their real rate of return is 
near zero, namely 0.3%: $1 in 1926 grew to $7.47 
by the end of 1985, with the price level at 6.10 (all 
rates of return are geometric annual averages). 

« Long-term government bonds perform similarly; 
they earn the rate of inflation plus 1% per year: $1 
grew to $11.03. 

+ Long-term corporate bonds have a rate of return of 
1.7% over inflation, or 4.8%: $1 in 1926 grew to 
$16.55 by the end of 1985. 
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Figure 1. Wealth indices for investments in the 
U.S. capital markets (1926-1985). Year-end 
1925 = 1.00. (Source: Ibbotson Associates.) 


+ Common stocks, as measured by the S&P 500 com- 
posite index, returned 6.7% plus inflation, or 9.8%, 
with 1926’s dollar being worth $279.12 in 1985. 

+ Small stocks, those New York Stock Exchange se- 
curities in the bottom fifth in capitalization, re- 
turned 9.5% plus inflation, or 12.6%; for these 
stocks, $1 grew to $1,241.24 in the 60 years. 


While small stocks outperformed common stocks by 
more than 4 to | in terms of cumulative wealth levels, 
their advantage has accrued only in the last two dec- 
ades, and most of the gains are in the 1974+ bull 
market. These returns are calculated on before-tax 
income, so that net return after taxes, adjusted for 
inflation for “riskless” investments and bonds, may 
well be negative for many investors. 

The difference in the total value of small-versus 
large-stock portfolio values is striking. Reinganum 
(1983a) computed the average stock price return by 
capitalization (Table I), with yearly rebalancing for 
the 18 years, 1963-1980, The results are overstated 
because Reinganum used daily data which has a bid- 
asked spread bias. 

Small stocks outperformed large stocks by more 
than 10 to 1 in this period. (Investors of small capi- 
talized stocks typically must sell at the bid and buy at 
the asked. Also, they move the market; see Stoll and 
Whaley 1983. Hence the average annual return of 
Portfolio 1 is probably overstated by 5-6%, so that 


the 10 to 1 edge, if properly measured, is more like 
4 to 1, which is less dramatic but still substantial.) 
During 1981-1986 (May), small stocks {the bottom 
two deciles) continued to outperform common stocks 
as a whole: they rose to $2.7737 per $1 by the end of 
1985 versus $2.3503 for common stocks (Ibbotson 
1986). Small stocks did better in 1981-1983. How- 
ever, common stocks, as measured by the S&P 500 
index, performed better in 1984-1987 (July). 

The higher returns for small stocks and for common 
stocks, as opposed to bonds or T-bills, were achieved 
by bearing greater risk. Table II shows Ibbotson’s data 
for the standard deviation of returns and the number 
of losing years during 1926-1985, 

Banz and Breen (1986) have pointed out that the 
Ibbotson data, as well as other data series based on 
the COMPUSTAT tape, suffer from at least two biases 
besides the bid-ask spread just mentioned. An ex-post- 
selection bias arises because the data base contains 
only companies that are currently viable, and excludes 
those that have merged, filed for bankruptcy or oth- 
erwise ceased to exist. Also, new companies enter the 
data base with a full history but without any data in 
the file at earlier dates. The look-ahead bias refers to 
the fact that data reported at particular times are 
usually not available to investors until dates in the 
next year. Banz and Breen have constructed a data 
base free of these biases. Their tests show a bias in the 
ordinary COMPUSTAT tape that favors small stocks. 
Hence the Ibbotson results are biased and slightly 
overstate the advantage of small stocks over medium- 
and high-capitalization stocks. The exact extent of the 
combined biases is not known. Note that tests based 
on CRSP data such as Reinganum’s do not suffer 
from these biases, 

Since 1981, the academic world has been fascinated 
by this small-firm effect and other anomalies. See 
Seligman (1983), Schwert (1983), Dimson (1986), 


Table I 
Average Stock Price Return by Capitalization 


Portfolio Decile edge Annes 
1 (smallest) +23.7 $452,800 
2 +17.9 185,000 
3 +18.5 201,600 
4 +16.2 139,500 
5 +15.2 127,900 
6 +15. 126,800 
7 +12.9 88,200 
8 +118 75,000 
9 +114 67,000 

10 (largest) +8.2 41,000 
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Table II 
Rates of Return and Their Risk (1926-1985) 
Number of 
Series Geometric Standard Losing 
Mean Deviation Years out of 
60 Years 
Small stocks 12.6% 36.0% 19 
Common stocks 9.8 21.2 19 
Long-term. 48 8.3 14 
corporate bonds 
Long-term govern- 4.1 8.2 16 
ment bonds 
U.S. Treasury bills 3.4 3.4 1 
Inflation 3.1 49 9 


Source: Ibbotson Associates. 


Keim (1986) and Ziemba (1986) for surveys of aca- 
demic research, and Lakonishok and Smidt (1987) for 
a 90-year look at the supposed anomalies. Rolf Banz 
(1981) and Marc Reinganum (1981), wrote the pi- 
oneering papers on the small-firm effect. Using a 
capital asset pricing model, they argued that even 
when the outcomes are adjusted for risk, small stock 
returns are higher. Brown, Kleidon and Marsh (1983) 
showed that the excess returns were linear in the log 
of size. The central question of interest to financial 
economists is whether the small-firm effect is a true 
anomaly or whether the extra gains are simply pay- 
ment for added risk once it is properly measured, 
since small firms tend to have higher market risk betas. 
Papers by Roll (1981) and Blume and Stambaugh 
(1983) have explored this question. Donald Keim 
(1983) made the important discovery that a large 
portion of the excess returns of small over large stocks 
occurs in January. He estimated that nearly 50% of 
the excess returns occur in this single month, of which 
26% are in the first week. Keim’s study suffers from 
the bid-asked spread bias which Blume and Stam- 
baugh correct for. Moreover, Blume and Stambaugh 
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Figure 2. Small stock portfolio minus S&P 500 
index (1926-1983). (Source: Ibbotson Associates.) 


argue that all of the size effect is in January, i.e., that 
the size effect is about 0.60% per day in January and 
zero the rest of the year. Tinic and West (1984), using 
data from 1935-1982 not controlled by size, showed 
that the positive relationship between risk and return 
is unique to January. The risk premiums in the re- 
maining months are not significantly different from 
zero. In further work on the same data, Ritter and 
Chopra (1987) found that the positive relationship 
exists only with small stocks. In essence, the pricing 
model is valid for small stocks only in January. De- 
spite these attempts to explain the small-firm effect 
with capital asset pricing and arbitrage pricing model 
tests, the academic world has not been able to find a 
defensible risk measure that brings the small-firm 
excess returns in January back into line. Gultekin and 
Gultekin (1983) show that the small firm effect occurs 
all over the world. 

In Figure 2, the Ibbotson data show that a small 
stock portfolio for the years 1926-1983 returned 
nearly 6% (arithmetic averages) more in January than 
the S&P 500 index. 


Table ILI 
Mean Return Differences between the 
Equally and Value-Weighted Indices 


January Trading Day during 
1963-1978 


December First Second Third Fourth 


Mean return 0.5647 1.186 0.6067 0.6107 0.4527 
t Statistic {4.72) (8.39) (3.86) (3.96) (3.05) 


Last Day 
of 


Moreover, Richard Roll (1983b), using data from 
1983 to 1978, found that most of the January gains 
(37% of the total, or 3.45%) are made on 5 days: the 
last trading day of December (the —1 day) and the 
first 4 trading days in January (+1 to +4), and much 
of the balance on the next 4-6 trading days. In total, 
67% of the 9.31% average annual return differential 
(biased upward slightly because of the bid-asked 
spread) for 1963-1980 between equally weighted and 
value-weighted indices of NYSE and AMEX stocks is 
attributable to the first 20 calendar days of January 
plus the last trading day in December. The mean 
return differences between the equally and value- 
weighted indices by trading day for the turn of the 
year days are very high and are all significant 
(Table IID). 

Jay Ritter (1986) has calculated that the average 
difference in returns between the smallest and largest 
deciles of market values on the New York Stock 
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Table IV 
Seasonal Behavior of Weekly Stock Market 
Absolute Risk Premiums (1963-1979): Rm — Rr 


Day Value-Weighted Equal-Weighted 


Relative to Market Returns Market Returns 
End of 

Year Rm- Ry t-Ratio Rm-R;  t-Ratio 

Rest of year —0.000679 —0.75 0.000223 0.22 

-12 0.003594 0.75 —0.001366 —0.25 

-Il —0.001362 -0.29 —0.001989 -0.36 

-10 —0.001967 —0.41 —0.009102 —-1.67 

-9 0.003528 0.74 0.001211 —0.22 

-8 —0.000377 -0.08 -0.001968 —0.36 

-7 0.003027 0.63 -0.000384 —0.07 

—6 —0.003648 -0.76 —0.004073 —0.75 

-5 0.008022 1.68 0.010486 1.92 

-4 —0.001206 -0.25 —0.002594 -0.48 

-3 0.000425 0.09 —0.000749 —0.14 

-2 0.001957 0.41 —0.000339 —0.06 

-I 0.006834 1.43 0.007857 1.44 

1 0.009289 1.95 0.030053* 5.51 

2 0.002526 0.53 0.020034* 3.67 

3 0.007946 1.67 0.015631* 2.87 

4 —0.000901 —0.19 0.006264 1.15 

5 0.002637 0.55 0.008836 1.62 

6 -0.000147 -0.03 0.002608 0.48 

7 0.001778 0,37 0.005080 0.93 

8 0.000694 0.15 0.001705 0.31 

9 —0.003851 —0.81 —0.002509 —0.46 

10 0.009044 1.90 0.009873 1.81 

11 0.005101 1.07 0.005775 1.06 

12 —0.004289 —0.90 -0.002252 -0.41 

13 0.004722 0.99 0.004235 0.78 
R-squared= 2.3% 7.4% 

Durbin-Watson statistic = 1.91 1.39 


* Significant at 5%, using a two-tailed test. 


Exchange is 9.99% (and ranges from 3.0% to 41.5%) 
for the nine trading days (—1 to +8) during the 
14 years 1971-1984. Table IV from Smidt and Stewart 
(1984) shows that risk premiums are much higher for 
small stocks than for large stocks on days (—1 to +4) 
and that they continue to be higher on days (+5 to 
+12), The highest returns, consistent with the Roll 
data and with Lakonishok and Smidt’s (1987) 90-year 
study, are on days (+1 to +3). 


2. Analysis of the Evidence 


Academic research on the turn-of-the-year effect has 
focused on three questions: 


+ What causes this anomaly? 

e Can its excess returns be explained by increases in 
risk? 

+ Can an investor make excess profits net of transac- 
tions costs by buying small stocks at the turn of the 
year? 


We now describe the current state of the conclusions 
reached so far for each question (for more details, see 
the cited references). 


Probable Causes 


The causes of the turn-of-the-year effect seem to be: 
(a) tax loss selling, (b) renewed buying interest in small 
stocks in the new year because of the availability of 
excess cash balances that, because of the low trading 
volumes, cause upward price pressure, (c) high trans- 
actions costs that prevent these patterns from being 
arbitraged away, (d) portfolio manipulations, and (e) 
turn-of-the-month and quarter price rise effects. We 
will look at the main points in turn. 


Tax Loss Selling. Branch (1977), Dyl (1977), Rein- 
ganum (1983b), Givoly and Ovadia (1983), Rozeff 
and Kinney (1976), Rozeff (1985a, b) and Wachtel 
(1942) have shown that securities with price declines 
in the previous year have high returns in January. 
These returns become even larger as the previous 
year’s decline increases. Williams (1986), using a ra- 
tional expectations model, shows that securities with 
heavy tax loss sales in December have larger compet- 
itive risk-premia and thus higher expected returns in 
January. Constantinides (1984, 1986) argues that, on 
logical grounds, though tax selling seems to be in- 
volved with the tumm-of-the-year effect, it cannot be 
the sole cause. Chan (1985) and DeBondt and Thaler 
(1985) found that the excess returns in January of 
stocks sold in December may last for as many as five 
years. Ritter observes that a higher percentage of small 
stocks are held by noninstitutional investors, who have 
a far greater incentive than institutions do to sell stocks 
that have declined in price. Losses can be deducted 
only following sales. Gains and losses realized prior to 
the last five trading days of the year affect that year’s 
taxes. Gains and losses realized in the last five days 
may be allocated to either year. Hence, during these 
last five days, the tax selling of losers is reinforced by 
the selling of winners by investors deferring the tax 
liability on the profits. Ritter, using transaction data 
from Merrill Lynch from mid-December 1970 to mid- 
December 1984, showed that in these 14 turn-of-the- 
year periods, the buy/sell ratios of individual investors 
are low for small stocks on days (—10 to —2) and high 
on days (+1 to +8). Fully 60% of the excess return on 
small stocks at the turn of the year is explained by 
buy/sell ratio changes. 


Renewed Buying Interest in Small Stocks in Janu- 
ary. Individual investors have available funds from 
tax loss sales, and locked-in profits from sales in 
the last five days, that appear on year-end account 
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statements, Christmas bonuses and other year-end 
financial receipts. They use these funds to buy stocks 
they consider underpriced, a high percentage of which 
are small stocks. Blume and Stambaugh, and Stoll 
and Whaley, have shown that the small firms are 
essentially represented by low-priced securities. 
Indeed, there is a significant positive relationship 
between mean excess return and the inverse of price- 
per-share. A portfolio of the lowest price stocks dom- 
inates a portfolio of the highest priced stocks by about 
10% per year during the 25 years from 1955 to 1979. 
Ken McNeil, a Calgary broker with Wood Gundy, 
mentioned that the action in January in low-priced 
stocks occurs when they hit key values such as $2, $5 
and $10, At that point, investors jump on the bargain 
bandwagon. 


Transactions Costs. Stoll and Whaley (1983) show 
that for the lowest-priced stocks, the total transaction 
costs average 2.93% bid-ask spread and 1.92% two 
way transaction, or nearly 5% for a round trip trade. 
Hence, although it seems advantageous to buy small- 
firm or low-priced stocks at the turn of the year 
because of the quick initial gains, virtually all of the 
excess gains would be eaten up with the transaction 
costs of a quick sale. Ritter estimates the (—1 to +8) 
gain of small stocks over big stocks to be 9.99% over 
the years 1971 to 1984. In addition, the gain would 
not be a capital gain. Stoll and Whaley find that 
the break-even holding period at which the after- 
transaction costs’ abnormal return is zero is about 
4 months for the smallest stocks. Partly because of 
these large transaction costs, we do not seem to ob- 
serve large amounts of this short-term trading. On 
average, these stocks are held and not sold in this 
period. 


Portfolio Manipulations. Positions and bonuses are 
largely based on performance. Hence, in late Decem- 
ber, portfolio managers may attempt to improve per- 
formance artificially by bidding up stocks already in 
their portfolios. This and other reasons lead to a shift 
to sales at the asking, from the bid on day (—1). For 
more on this phenomenon, see Lakonishok and Smidt 
(1984), Roll (1983a, b) and Ritter. 


Turn of the Month and Quarter Price Rises. Ariel 
(1987a, b), using data from 1963 to 1981, has shown 
that stock prices, on average, rise in the first half of 
each month and are flat during the second half; see 
Figure 3. This idea dates at least to Merrill (1966). 
Fosback (1976) has studied it and has discussed and 
successfully tested it, using no-load mutual funds and 
stock index futures, in his newsletter, Market Logic; 
see, for example, the May 1, 1987 issue. See also 
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Figure 3. The turn-of-the-month effect: 1963-1981. 
(Source: Ariel 1987.) 
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Hirsch (1979, 1986). The major effect is again on days 
(—1 to +4), with the strongest effect occurring during 
January. The effect on the first month of the other 
three quarters is larger than the other eight months. 
Lakonishok and Smidt (1987) give an extensive view 
of the effect over 90 years of data on the high capital- 
ization Dow Jones Index. While they find the effect 
Ariel reports for recent periods, it does not occur over 
the entire period. They treat (—1) as part of the pre- 
vious, rather than the succeeding, month. The turn- 
of-the-month effect seems to be attributable to several 
factors: end-of-month portfolio adjustments by insti- 
tutions, investment of monthly stock purchase plan 
receipts by mutual funds, and monthly salary and 
other receipts by the investing public. 


Excess Returns 


Can the excess returns be explained by increases in 
risk? The jury is out on this crucial question. However, 
some good insights appear in Rogalski and Tinic 
(1986). They utilized data from 1963 to 1982 on all 
New York Stock Exchange and AMEX securities, 
developing 20 equally weighted portfolios based on 
size. Portfolio 1 has the smallest stocks and 20, the 
largest. They reached the following conclusions. 


+ The hypothesis that the mean return for portfolio 1 
in every month is equal is false, but it cannot be 
rejected once January is eliminated. 

There is a seasonality in risk, as the following data 
show. 

The £ coefficients of portfolios 1-5 are much larger 
in January than in any other month when measured 
using daily data and the CRSP equally weighted 
index as the market return (8, the covariance of the 
portfolio, with the market portfolio divided by its 
variance, is the measure of the portfolio’s risk, con- 
sistent with the capital asset pricing model). 

+ The higher 8’s of small stocks in January are not 


88 


Calendar Anomalies and Arbitrage 


804 / CLARK AND ZIEMBA 


solely attributable to thinner trading volumes during 
that month. 

« The January £ of the smallest firms is almost twice 
that of the largest firms. 

» The January variance of portfolio 1 is nearly four 
times that of portfolio 20. 

+ The precise results are sensitive to the precise data 
set used, but the conclusions are robust. 


Excess Profits 


Can an investor make excess profits, net of transac- 
tions costs, by buying small stocks at the end of the 
year? 

The transactions costs are about 5% each way for 
small stocks and the potential gains are in the 10- 
15% range. Hence, there may be a small amount of 
gain left after a two-way transaction, but the gain is 
likely to be small. 


3. Strategies 


The subject of profits brings us to the thrust of this 
paper. There is strong evidence, as Figure 2 shows, 
that small stocks outperform large stocks at the turn 
of the year. Yet transactions costs eat away most, if 
not all, of the potential gains. Such costs on index 
futures are a tenth or less of the corresponding basket 
of securities. Hence, a strategy that should be very 
profitable is to hold long positions in a small stock 
index and short positions in large stock indices. 
Stock index futures began trading in 1982 in the 
United States, and the number of different contracts 
available, and their volume, have increased steadily. 
The ideal way to play the turn-of-the-year effect is to 
be long on the smallest stocks and short on the largest 
stocks in liquid index contracts. Our experience is 
with the spread between the Value Line and S&P 
indices, called the VL/S&P spread. This strategy is 
not an ideal way to play the effect, but it is the best 
we have found so far, and it has been successful. The 
VL index is an equally weighted geometric average of 
the prices of nearly 1,700 securities with futures traded 
on the Kansas City Board of Trade and futures options 
traded on the Philadelphia Stock Exchange. IBM and 
the smallest company in the index are treated equally 
in the weighting. The VL index has a downward drift 
of about 5.5% per year relative to the component 
securities in the index because of geometric averaging, 
due to the geometric-arithmetic averaging inequality. 
The amount of the downward drift depends upon the 
variance of price movements of the component secu- 
rities. The bias is approximately equal to half the 
average unique risk; see Modest and Sunderasan 


(1983). The higher the unique variances, the higher 
the drift. For more on the mathematics of the VL 
index, see Eytan and Harpaz (1986). The S&P 500 
futures contract is traded on the Chicago Mercantile 
Exchange, and is value weighted. Hence, IBM and the 
other large stocks count much more than the medium- 
size stocks at the bottom of the index. Hence, in a 
crude fashion, the VL/S&P spread gives you the small 
stocks long and the large stocks short. The bigger the 
stocks, the shorter they are. However, all the small 
stocks and medium stocks are held in the same pro- 
portion. 

The VL contains few of the smallest decile stocks 
on the NYSE, but it is the best index that is traded for 
our purposes. All of the other index futures traded in 
the United States are either value weighted or com- 
prise only a small number of large capitalization se- 
curities, such as the Major Market Index. For both 
the S&P and VL futures, four contract months are 
traded: March, June, September and December. The 
contracts currently expire on the third Friday of the 
contract month. (Until December 1985 the VL ex- 
pired on the last trading day of the third month.) This 
day is referred to as the triple witching day since 
futures, options and futures on options all expire at 
that time. On occasion this day has had tremendous 
volatility because program traders must unwind their 
positions before expiry. Beginning with the June 1987 
S&P contract, new rules went into effect: trading now 
ends on Thursday’s close but settlement is based on 
Friday’s opening prices. Whether or not this cumber- 
some procedure will work remains to be seen. The VL 
contract’s expiry remains at Friday’s close. 

For maximum liquidity and the smallest bid-ask 
spreads, especially since the VL/S&P spread trade 
must be made over two exchanges, it is best to use the 
March contract. Also, since the trade should be com- 
pleted at the end of January at the latest, there is no 
need to consider the June contract. The S&P contract 
is very liquid and trades about 75,000 contracts a day, 
but the VL is much less liquid, trading about 3,000 
contracts a day. Each spread entails two commissions 
{possibly computed as only 1.5 times a single futures 
contract commission), payable on sale, which cost 
from as much as $200 at a full-service broker down 
to $50 or less at a discount broker, Each contract 
is worth $500 times the index value. Hence, each 
point difference gains or loses the trader $500. Thus, 
0.05-0.40 is needed to cover the transactions costs. 
The margin requirements are about $1,100 per spread 
and are subject to change by the exchanges. (These 
margins will be much larger for the 1987/88 turn of 
the year because of the effects of the October 19, 1987 
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stock market crash.) Typically an investor keeps one 
or more $10,000 T-bills for this security plus a separate 
interest-bearing account to cover offside positions. 
The 1 — | weighting seems to minimize the risk of 
the overall position, since the standard deviation of 
the S&P and VL index futures are about 1.82% and 
1.74% per day with a daily correlation of 0.967, 
according to Donaldson, Lufkin and Jenrette’s Fu- 
tures Service. If the standard deviations and correla- 
tions change over time, the optimal spread may not 
bel —-1. 

When should an investor trade? We have found that 
the following rule works well: buy the spread on the 
first closing uptick, starting on December 15 and 
definitely by the 17th, and sell on January 15. Waiting 
until (—1) now seems to be too late: possibly the 
number of finance professors and their colleagues, as 
well as other students of the turn-of-the-year/January 
effect who are in on the strategy, move the VL index. 
There seems to be a bidding up of the March VL 
future price relative to the spot price. Table V shows 
the results from the ten years 1977-1986. By January 
15, the biggest gains are over and the risks increase. 
On average, the spread drops 0.92 points in this 
period, with a high variance. The projected gain from 
a successful trade is 0-5 points and averages 2.85 
points or $1,342.50 per spread, assuming a commis- 
sion of 1.5 x $55. On average, the December 15 to 
(—1) day gain on the spread is 0.57 points. However, 
it was 1.05 in 1985 and 3.15 in 1986, which may 
reflect the fact that with the thin trading in the VL 
index, the market can be moved with a reasonably 


small number of players who are learning about the 
success of this trade, i.e., the basis was bid up antici- 
pating the January move. We made this trade in 
1984/85, 1985/86 and 1986/87. The closing uptick 
rule sharpens the edge slightly. For example, in 
1985/86, the spread closed at 2.30 on December 17 
and we were able to buy it as low as 1.65 and 1.40, 
versus the 2.80 on December 15. The differential of 
the cash spread to the future spread has been of great 
assistance. As in most speculative markets, prices 
change after the market has exhausted the staying 
power of those players who make ill-timed decisions, 
and the cash/futures spread when employed as an 
oscillator (i.e., overbought/oversold) assists us in mar- 
ket entry timing around our time windows of Decem- 
ber 15-17 and January 15. Figures 4 and 5 graphically 
display the VL/S&P spread during December, for 
these ten years, One additional rule that would have 
added to profits in 1977/78, 1979/80, 1981/82, 
1982/83 and 1983/84 is to double up the position if 
the December 15 to (—1) period results in a loss for 
the trade. Table VI demonstrates the results of this 
modification: The profits from the trade triple from 
$993.80 to $2,724. 


5. The 1986/87 Play 


For our 1986/87 play we attempted to optimize our 
investment. The data in Table V yields an estimated 
mean return of +2.85 per contract. Dennis Capozza, 
a colleague at the University of British Columbia, 
estimated that the average standard deviation of the 


Table V 
Results from VL/S&P Spread on Various Buy/Sell Dates for the Ten Turn-of-the-Years (1977-1986)% 
Turn of the Spread Spread Spread Spread Difference Difference Difference < “Trade Gato Net Profit 
Year Dec.15 (1) Jan. 18 EndJan, «= Dees 15 (1) to Jan. Sto Dec. 150 Sn trade? 
i ij ü to(—1) Jan. 15 End Jan, Jan. 15 
Spot Prices 

1976/77 —14.74 -14.07 -10.92 —9.02 +0.67 +3.15 -1.90 +2.94 $1,387.50 
1977/78 -1.17 2.42 0.67 1.24 -1.25 +3.09 +0.57 +0.98 $407.50 
1978/79 0.79 4.53 3.44 5.40 +3.74 —=1.09 +1.96 uE pA $477.50 
1979/80 12.00 10.87 15.82 15.18 -1.13 4.95 0.64 +3.82 $1,827.50 
1980/81 9.16 11.77 11.22 11.24 +2.61 -0.55 +0.02 +2.06 $947.50 
1981/82 14.78 13.91 14.90 11.86 —0.87 +0.99 —3.04 +0.12 —$22.50 
1982/83 18.03 17.06 22.61 20.10 -0.97 +5.55 —2.51 +4.58 $2,207.50 

March Futures Prices 
1983/84 30.85 29.40 32.10 27.70 —1.43 +2,69 —4.40 +1.26 $547.50 
1984/85 11.80 12.85 16.60 19.50 +1.05 +3.75 +2.90 +4.80 $2,317.50 
1985/86 2.80 5.95 6.20 4.02 +3.15 +0.25 -2.18 +3.40 $1,617.50 
Average +0.57 +2.28 —0.92 +2.85 $1,342.50 


* The years 1974/75 and 1975/76 had extremely strong effects as well. 
t With 1.5 x $55. = $82.50 transactions cost. Lower transactions costs, in the $40-$50 range, are possible with discount brokers. 
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Figure 4(a-e). VL/S&P spread in December and 


January for 1976/77 to 1980/81. 


VL/S&P spread in 1986 was 3.00. This assessment 
yields the following approximate return distribution 
for the trade. 


Probability 
0.007 
0.024 
0.070 
0.146 
0.217 
0.229 
0.171 
0.091 
0.045 


Gain 


KF OK KNWHUD ~) 


For repeated investments over time, the capital 
growth criterion that involves the maximization of the 
expected logarithm of returns has many desirable 
properties: it maximizes the asymptotic rate of growth 
of the investor’s fortune; it minimizes the expected 
time to reach a preassigned (sufficiently large) goal; a 
period-by-period myopic optimization policy is opti- 
mal; it allows the investor to invest more as the 
situation becomes more favorable; it can take into 
account possible effects of the investor's purchases on 
the return distributions and thus allow for simple 
operational decision rules for actual applications; and 
it has the never-risk-ruin property of logarithmic util- 
ity. See e.g., Kelly (1956), Brieman (1961), Thorp 
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Figure 5(a-e). VL/S&P spread in December and 
January for 1981/82 to 1985/86. 


(1975), and Hakansson (1971, 1979). Ziemba and 
Hausch (1986, 1987) give a practical discussion of 
these concepts in layman’s terms, along with simula- 
tions. 

In practice, the capital growth, or Kelly criterion, as 
it is known in the speculative investment literature, 
tends to suggest extremely high wagers when the in- 
vestment situation is very favorable. Still, on balance, 
it seems to be the most desirable investment strategy 
concept for repeated speculative situations involving 
a small number of asset choices. 

For the turn-of-the-year play with a mean return of 
2.85 and standard deviation of 3.00, the optimal wager 
is a staggering 74% of one’s fortune! Indeed, in the 


long run, according to the theory, with such a strategy, 
the investor will, with probability approaching one, 
accumulate more money than investors with essen- 
tially different strategies. Still, with the uncertainty 
involved, lower wagers are suggested, especially if the 
distribution just described is overoptimistic. After all, 
the data in Table V have a sample size of only ten. 
Moreover, there are fluctuations, margin calls and the 
like to consider. MacLean, Ziemba and Blazenko 
(1987) have developed methods to generate a com- 
plete trade-off of risk versus return in such dynamic 
investment situations, using “fractional Kelly strate- 
gies.” The lower the fraction of the “optimal Kelly 
wager” invested, the higher is the security and vice 
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Table VI 
Results from VL/S&P Spread with the Double-Up Modification 
Two 
a A Net Profit 
Difference Times : 
Turn of the Year Tr Dierenees Trade Gain Net Front poets 
to) CD to Modification 
Jan. 15 
Spot Prices 
1977/78 1.25 6.18 4.93 $2,300 $407.50 
1979/80 —1.13 9.90 8.77 $4,220 $1,827.50 
1981/82 —0.87 1.98 Lil $390 —$22.50 
1982/83 —0.97 11.10 10.13 $4,900 $2,207.50 
March Futures Prices 
1983/84 =1.43 5.38 3.95 $1,810 $547.50 
Average =1.13 6.91 5.78 $2,724 $993.80 


versa. Figure 6 displays the probability of doubling, 
tripling and tenfolding one’s fortune before losing half 
of it, as well as the growth rate, for various fractional 
Kelly strategies. At fractional strategies of 25% or less, 
the probability of tenfolding one’s fortune before halv- 
ing it exceeds 90%, with a growth rate in excess of 
50% of the maximal growth rate. Figure 7 gives the 
probability of reaching the distant goal of $10 million 
before being ruined for Kelly, half-Kelly and quarter- 
Kelly strategies with wealth levels in the range of 
$0-$10 million. The results indicate that the quarter- 
Kelly strategy seems very safe with a 99+% chance of 
achieving this goal. 
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Figure 6. Relative growth rate and the probability 
of doubling, tripling and tenfolding before halving 
for various fractional Kelly strategies. 


We employed the concepts in a $100,000 specula- 
tive account for a client of CARI Ltd., a Canadian 
investment management company. We decided to 
purchase five VL/S&P spreads to approximate a 
slightly less than 25% fractional Kelly strategy. Watch- 
ing the market carefully, we bought these on Decem- 
ber 17, 1986 at a spread of —22.18, which was very 
close to the minimum price of the spread. On Decem- 
ber 15, the spread closed at —20.90, and on the 16th 
at —22. It increased in value to —18.15 at the end of 
the year (in a flat and declining stock market) for a 
gain of 3.85. In January the stock market took off in 
an impressive style, with ten consecutive up days. The 
spread continued to gain, and we cashed out at —16.47 
on January 14 for a total gain of 5.55 points per 
contract, or a total gain of $14,278.50 after transac- 
tions costs. Figure 8 displays the spread during Decem- 
ber 1986 and January 1987. The spread began to drop 
sharply on the 15th and the drop escalated into a rout 
of the small stocks in comparison with the big stocks 
in the S&P 500. At the end of January the index stood 
at —31.45. Our experience demonstrates that the trade 
must be handled carefully and that, more or less, the 
December 15th to January 15th period is the best time 
to trade. More and more players are moving the 
market in the December 15-31 period, and as in 
1985/86, most of the gains occurred during that 
period. 


6. Conclusions 


The turn-of-the-year effect offers many interesting 
avenues for exploration. We will decribe a few. 


+ The effect might be played using options, since the 
variances and $’s of small stocks are much larger 
than large stocks in January. 
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Figure 7. The probability of reaching $10 million 
before ruin for Kelly, half-Kelly and quarter-Kelly 
strategies. 


+ A play might be devised using the Toronto Stock 
Exchange (TSE) 300 index, since Tinic and Barone- 
Adesi (1986) found that excess returns in January 
are nearly 4% and returns are negative in the other 
months. See also Berges, McConnell and Schlar- 
baum (1984) and Tinic, Barone-Adesi and West 


. 


. 


(1988). In in addition, the new Canadian capital 
gains tax revisions of June 1987 may give added 
pressure to the TSE in December 1987 and 1988 
and thus improve the chances for and the extent of 
these two turn-of-the-year plays. 

The Japanese stock market is another possible area 
for investigation; Kato and Schallheim (1985) give 
evidence for a strong January effect over the period 
1952-1980. 

Another strategy might involve U.S. stock indices 
such as the NYSE and Major Market, and the 
discrepancies between spot and futures prices: see, 
for example, Cornell (1985), Cornell and French 
(1983a, b), Modest and Sundaresan (1983) and 
Figlewski (1983, 1985). 

A study might be undertaken to determine the effect 
of the new United States tax laws on the turn-of- 
the-year effect. 


* Better weighting and investing strategies might be 


devised using dynamic asset allocation strategies. 


e Proper tests and ways to devise data sets need to be 


developed to evaluate the real strengths and risks of 
these effects. 

Finally, some basic questions remain unanswered. 
How much of this “data snooping,” as Lakonishok 
and Smidt (1987) call it, is real? Is the small first 
effect a measured relationship or a real anomaly, 
and how much learning is going on to move the 
market toward efficiency? On this point, see also 
Merton (1986) and Black (1986). 
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Figure 8. VL/S&P spread in December 1986 and January 1987. 
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Arbitrage Strategies for 
Cross-Track Betting on Major 
Horse Races* 


I. Introduction 


Racetracks and securities markets have many 
characteristics in common. A difference, though, 
is their complexity; the racetrack is really a se- 
quence of markets that are relatively simple, 
short-lived, and, for the most part, independent. 
This ‘‘market-in-miniature’’ feature makes the 
racetrack attractive for tests of market effi- 
ciency, especially since, as Thaler and Ziemba 
(1988, p. 162) suggest, ‘‘one can argue that wa- 
gering markets have a better chance [than securi- 
ties markets] of being efficient because the condi- 
tions (quick, repeated feedback) are those which 
usually facilitate learning.” The many empirical 
racetrack studies support a weak form of ef- 
ficiency for some of the available wagers, while 
other types of wagers seem not to be efficient. 
These studies are reviewed in Section II. 

This article studies cross-track betting, a rela- 
tively new form of wagering. It allows bettors to 
wager at their track (a cross track) on a race 


* Without implicating them, we would like to thank Bruce 
Fauman and Fraser Rawlinson. Also, we greatly appreciate 
the data supplied by a number of U.S. racetracks, and we 
wish to thank Victor Lespinasse for suggesting the one-track 
model. 
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Cross-track betting per- 
mits bettors to place 
wagers at their local 
tracks on a race being 
run at another track. 
Since each track oper- 
ates a separate betting 
pool, the odds can vary 
across the tracks. The 
data suggest that the 
odds vary, and they of- 
ten vary dramatically, 
allowing arbitrage op- 
portunities. This article 
employs a risk-free ar- 
bitrage model to dem- 
onstrate the cross-track 
inefficiency and recom- 
mends an optimal capi- 
tal growth model for 
exploiting it. A simpler 
method is proposed for 
a single bettor at a sin- 
gle cross track. The re- 
sults indicate that these 
methods would have 
worked well in practice 
on a number of recent 
Triple Crown races. 
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TABLE 1 Home-Track and Cross-Track Betting, Kentucky Derby 

Home-Track No. Cross Home-Track Cross-Track 
Year Attendance Tracks Betting ($) Betting ($) 
1982 141,009 ihe 5,011,575 
1983 134,444 hes 5,546,977 Tae 
1984 126,453 24 5,420,787 13,521,146 
1985 108,573 32 5,770,074 14,474,555 
1986 123,819 56 6,165,119 19,776,332 
1987 130,532 73 6,362,673 20,829,236 
1988 137,694 93 7,427,389 24,449,058 


being run at another track (the home track). Since cross-track betting 
tends to be limited to major races, it gives the racing public an opportu- 
nity to bet on some of the world’s finest racehorses. This makes it very 
popular with the public. Cross-track wagering can lead to increased 
attendance and revenues at the cross tracks and add to the revenues of 
the home track through a fee (usually 5% of the handle) paid by the 
cross tracks. Thus, all the tracks can increase profits. ' 

Separate pools for each track means the payoffs at the various tracks 
can differ.? Due to the costs of arbitrage in this setting, market 
efficiency across the tracks would, for practical purposes, allow some 
differences across the various sets of track odds. Considerable differ- 
ences, however, would suggest the possibility of a market inefficiency. 
The data demonstrate that considerable differences do occur. For ex- 
ample, a $2.00 win ticket on Ferdinand, the winner of the 1986 Ken- 
tucky Derby, paid from $13.20 at Fairplex in Pomona, California, to 
$90.00 at Evangeline Downs in Lafayette, Louisiana.’ Obviously, bet- 
tors would have preferred their win bets on Ferdinand to be made at 


1. For an example of this effect, consider the home-track and cross-track betting on 
the Kentucky Derby (see table 1). The introduction in 1984 of cross-track betting on this 
race has greatly increased total Derby wagering. At the same time, cross-track betting 
seems to have had, at worst, only a minor effect on home-track wagering. The cross 
tracks’ revenues can increase also. For instance, Illinois set a one-day pari-mutuel 
record on Kentucky Derby Day, 1987. Cross-track betting on the Derby accounted for 
$1,326,239 of the $4,534,879 wagered that day in the state. Another example is Calder 
Race Course in Florida. They set all-time revenue and attendance records on Kentucky 
Derby Day, 1985. Attendance was 23,105, and $2,775,645 was wagered, $562,453 of it on 
the Derby. 

2. In some cases, all the wagers at the various tracks are summed. Then, on the basis 
of these summed values, identical payoffs are made at all the tracks. This is often called 
‘‘intertrack’’ wagering and typically the tracks are within one state. Intertrack wagering 
will not be considered here. 

3. These extreme payoffs are not just limited to the smaller tracks. Two large-track 
examples are Hollywood Park, where Ferdinand paid $16.80, and Woodbine Racetrack 
in Toronto where he paid $79.60. Alysheba, the 1987 Kentucky Derby winner, paid from 
$15.80 at Hollywood Park, California to $30.20 at Beulah Park, Ohio. Winning Colors, 
the 1988 derby winner, paid $7.40 at Pimlico Race Course, Maryland, and $10.40 at 
Beulah Park. 
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Evangeline Downs. The nature of the pari-mutuel betting system re- 
quires that if Ferdinand paid less at Fairplex than at Evangeline 
Downs, then another horse, were it to have won, would have paid 
more at Fairplex than at Evangeline Downs. Thus, if we are able both 
to learn the odds and place our bets at various tracks, it appears that 
significant arbitrage opportunities may exist. 

Section III develops a risk-free arbitrage model to demonstrate this 
cross-track inefficiency. The optimal capital growth model is studied in 
Section IV on the general cross-track problem and, in Section V, ona 
simpler one-track problem. These models are tested on data from sev- 
eral recent Triple Crown races. A final discussion is in Section VI. 


II. Efficiency of the Various Betting Markets 


Among the possible wagers at the track are the so-called straight wa- 
gers to win, place, and show. They pay off when one’s horse is at least 
first, second, or third, respectively. The ‘‘exotic’’ wagers include 
quinellas (requiring one to name the first two horses), exactors (requir- 
ing the first two horses in the correct order), trifectas (requiring the first 
three horses in the correct order), and daily doubles (requiring the 
winners of two consecutive races). Tracks have also extended the 
daily-double concept to picking the winners of three, four, six, and 
even nine consecutive races. These are very low-probability bets that 
can have tremendous payoffs, and they are very popular with the rac- 
ing public. Before exotic wagering was offered by the tracks, bettors 
could use parlays and other combinations of wagers to construct low- 
probability/high-payoff situations. Rosett (1965) analyzed these pos- 
sibilities and demonstrated that, except for extreme long shots, the 
bettors were rational in the sense that a simple bet would not be made if 
a parlay with the same probability of success had a greater return. 
Similarly, Ali (1973) showed that the return on a daily double is not 
significantly different from the return on what is an identical wager, the 
corresponding parlay of win bets. 

Unlike typical casino games, where the odds are fixed, the odds at 
the track are determined by the relative amounts the bettors wager on 
the horses and by the track’s transactions costs (the track’s take and 
breakage). Thus, ‘‘prices’’ are determined at the track much like they 
are in securities markets. Let 


n = the number of horses in a race; 
T = the number of tracks accepting wagers on the race; 
W; = the total amount bet by the public at track ¢ on horse i to 
win(@i=1,...,nand/=1,...,7); 
W' = 3, Wi = the win pool at track t; and 
Q' = track čs payback proportion (typically from .80 to .86). 


ll 
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Then the payoff per dollar bet to win on horse i at track ¢ is 
O'W'/W: if horse i wins, 0) 
l 0 otherwise. 


To determine the place payoff let P} be the amount bet to place on 
horse i at track ż¢ and let P' = £;P; be track t’s place pool. The payoff 
per dollar bet to place on horse i at track ¢ is 


(2) 


1 + (Q'P' — Pi — P)/(2P;) if the first two horses are i and j, 
l 0 if horse i is not first or second. 

Thus, the track keeps (1 — Q')P' and the place bets on i and j are 
repaid. The remainder, the losing bets minus the track take, is then 
split evenly between those who bet on i and those who bet on j. The 
share for i bettors is then divided on a per-dollar-bet basis. The place 
payoff on i does not depend on whether i was first or second but it does 
depend on which horse j was the other top finisher. In a similar fashion 
the payoff per dollar bet to show on horse i at track ¢ is 


1 + (Q'S' — Si — Si — S})/@BS!) if the first three horses are i, j, 
and k, (3) 
0 if horse i is not at least third. 


Here Si is the show bet on horse i by the public at track 7, and S’ = }$;S; 
is track t’s show pool. It is possible that (Q’S’ — S} — S; — Si)/3S}) is 
less than 0.05, or even negative. In these cases, called minus pools, the 
track usually agrees to pay $0.05 profit for each dollar wagered. Minus 
pools can also occur in the win and place markets, but are much less 
common. Equations (1)—(3) ignore breakage, the additional charge that 
results from the track rounding all payoffs down to the nearest 5 or 10 
cents cn the dollar.* 

If prices reflect all available information then a market is said to be 
efficient (see Fama 1970). There are two conclusions that can be drawn 
from the many studies of win market efficiency (see, for instance, Ali 
[1977]; and Snyder [1978]). First, the North American public underbets 
favorites and overbets longshots, and this bias appears across the 
many years that data have been collected and across all sizes of race- 
track betting pools.’ Second, despite its strength and stability, this bias 
is almost always less than the track take and thus it cannot be exploited 
to achieve positive profits. Figure 1 illustrates the favorite/longshot 


4. Breakage may seem like a relatively minor cost, but we (Ziemba and Hausch 1987) 
demonstrate that it can have a dramatic long-run effect on a bettor’s fortune. 

5. Busche and Hall (1988) demonstrate an opposite bias for Hong Kong bettors. Con- 
trary to the North American bettors, Hong Kong bettors tend to overbet favorites and 
underbet long shots—a bias that is consistent with risk aversion. 
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Fic. 1.—Expected return per dollar bet versus odds level: aggregation of 
studies involving more than 50,000 races. (Source. —Ziemba and Hausch 1986.) 


bias and shows that the track take (assumed to be 15.33%, as it is in 
California) is sufficiently large to preclude profits. The one profitable 
exception is extreme favorites at odds of 3-10 or less. However, they 
are relatively rare. Thus, if we define a market to be weakly efficient 
(see Fama 1970), if no one can devise a profitable trading rule based on 
historical price information, the win market is, for practical purposes, 
weakly efficient. 

Hausch, Ziemba, and Rubinstein (1981) tested the efficiency of the 
place-and-show markets. They assumed: 

ASSUMPTION 1. If q; is the probability that horse i wins, then the 
probability that jis first and j is second is q,q,/(1 — q;), and the probabil- 
ity that i is first, j is second, and & is third is 


aga — gM - qi — gi. 
(These formulas were developed and tested by Harville [1973].)® 


6. Henery (1981) and Stern (1987) show that these equations can be derived by as- 
sociating with each horse an independent exponential random variable with a scale 
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ASSUMPTION 2. The win market is efficient, so the win odds can be 
used to estimate q;. 

Using these two assumptions and equations (2) and (3), Hausch, 
Ziemba, and Rubinstein (1981) were able to identify horses that were 
underbet to place or show. The optimal capital growth model then 
determined the place-and-show wagers that maximized the expected 
rate of growth of one’s bankroll. Since the exact model is a com- 
plicated nonlinear optimization problem that is difficult to solve at the 
track, Hausch, Ziemba, and Rubinstein (1981) developed simple re- 
gression approximations with quite minimal data-entry requirements. 
Their empirical studies on two seasons of racing data indicated that 
significant returns on the order of 11% were possible in the place-and- 
show markets. We (Hausch and Ziemba 1985; Ziemba and Hausch 
1987) extended Hausch, Ziemba, and Rubinstein’s (1981) results to 
provide further evidence of the place-and-show inefficiency. We 
(Ziemba and Hausch 1986) and Asch and Quandt (1987) studied ineffi- 
ciency of the exotic markets. Asch, Malkiel, and Quandt (1984, 1986) 
investigated whether a drop in the odds late in the betting period might 
reflect inside information and thereby point to wagers that may have 
positive expected returns. Their results suggest that this is not the 
case, however. A more thorough literature survey is in Thaler and 
Ziemba (1988) and Hausch and Ziemba (1990). The latter also studies 
racing outside of North America. 


II. Inefficiency of the Win Market and the Risk-free Hedging Model 


The literature has demonstrated weak efficiency of the win market at a 
single track, despite a favorite/longshot bias. To test whether this weak 
efficiency is maintained across the win markets with cross-track bet- 
ting, data were collected on several recent Triple Crown races. Al- 
though cross-track betting is becoming more popular, it tends to be 
testricted to major races. The best known of these are the Triple 
Crown races: the Kentucky Derby at Churchill Downs on the first 
Saturday in May, the Preakness Stakes at Pimlico Race Course 2 
weeks later, and the Belmont Stakes at Belmont Park 3 weeks after 
that. 

A simple problem is considered first. Allowing only win betting, 
what is the minimum amount that our bettor must wager to ensure the 
return of $1.00 regardless of which horse wins the race? The solution to 
this problem is, for each horse, to identify the track that has it at the 
longest odds and bet just enough to receive $1.00 if it wins. The solu- 
tion involves no estimation of the horses’ win probabilities. If this 


parameter equal to the inverse of its win probability. Then, any ordering of the random 
variables is just the Harville formulas. Stern also develops alternative ordering formulas 
using gamma distributions that are more accurate but more complicated. 
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TABLE 2 Projected Win Payoffs, 1983 Preakness 
$ Amount of 
Highest Win Return Wager That 
Horse No. (on a $1 Bet) Track Will Return $1 
1 29.40 Louisiana Downs .0340 
2 12.70 Louisiana Downs .0787 
3 34.60 Los Alamitos 0289 
4 169.90 Hollywood 0059 
5 56.90 Louisiana Downs .0176 
6 5.70 Louisiana Downs 1754 
7 10.60 Pimlico 0943 
8 76.60 Louisiana Downs 0131 
9 116.10 Hollywood .0086 
10 2.20 Los Alamitos 4545 
11 40.60 Los Alamitos .0246 
Total 9356 


solution were employed at one track, then our bettor would have to pay 
1/Q dollars (an amount greater than $1.00) to ensure a return of $1.00. 
With the opportunity of betting at several tracks, each with a different 
set of odds, we may be able to lower this minimum amount to below 
$1.00. To see how this system works, consider the 1983 Preakness. The 
final win odds were collected from 11 tracks that allowed wagering on 
this race.’ The highest of the 11 win payoffs on horse number 1 was 
$29.40 per dollar wagered at Louisiana Downs. Thus, a win bet of 
$0.0340 there would have returned $1.00 when, in fact, horse number 
1, Deputed Testamony, won the race. This wager and those on the 
other horses are presented in table 2. 

Thus, by wagering $0.9356, our bettor is guaranteed $1.00 regardless 
of who wins the race. This is a certain profit of $0.0644 per $0.9356 
wagered, or a guaranteed return of 6.9% rate of return in a 2-minute 
race, 

Obviously, ‘‘risk-free’’ arbitrage is not possible. Implementing the 
system requires that the track odds be sent to a central decision maker 
several minutes before the end of betting, and during that time the odds 
can change. The system does, however, demonstrate the large dis- 
crepancies in betting across the tracks and shows how simple it can be 
to take advantage of them. Table 3 presents the results of applying the 
system to the data from other Triple Crown races. 

Three of the races had insufficient variance in the win odds across 


7. The number of outlets involved in cross-track betting has increased over the years. 
The 1985 Kentucky Derby had betting at 32 outlets, including New York City’s off-track 
betting (OTB). The 1985 Preakness and Belmont had 28 and 41 outlets, respectively. For 
1988, the number of outlets for these three races were, respectively, 93, 87, and 76. We 
requested the final win, place, and show data from the outlets that we knew had accepted 
wagers. Response rates tended to be low and, in some cases, the data were no longer 
available, having been stored for only a short period. All the data received were used in 
the analysis. 
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TABLE 3 Risk-free Hedging Model Results 
No. Horses No. 
Race in Race Tracks % Profit 
Preakness: 
1982 7 5 0 
1983 11 11 6.9 
1984 10 4 .0 
1985 11 7 2.5 
Belmont: 
1982 11 5 13.6 
1983 12 9 8.5 
1984 It 2 0 
1985 9 i 5.0 
Kentucky Derby: 
1984* 12 7 JA 
1985 12 6 10.1 
Average 4.7 


* Cross-track betting on the Kentucky Derby did not begin until 
1984. 


the tracks to allow a risk-free profit. This is not surprising for the 1984 
Belmont because we had data from only two tracks. Also, the results 
for the 1982 and 1984 Preakness are based on only five and four tracks, 
respectively. In these three races our bettor would obviously make no 
wagers. These three races giving a 0% return together with the other 
seven races have an average risk-free profit of 4.7%. This rate of profit 
is for the certain return of $1.00. As the required certain return in- 
creases, then the wagers begin to affect the odds and this rate of profit 
will decrease.® 


8. This ‘‘certain return’’ scheme can be extended to include place-and-show betting. 
Let Ri be the return on a $1.00 win bet on horse i at track t when i wins. Similarly, let Rj 
be the return on a $1.00 place bet on i at track ¢ when i and j are the top two finishers, and 
let Ri, be the return on a $1.00 show bet on i at track ¢ when i, j, and k are the top three 
finishers. Formulas.(1)-(3), corrected for breakage, determine Ri, Ri; and Ri. The deci- 
sion variables are amounts to wager to win, place, and show on horse i at track t and are 
represented, respectively, by w!, pf, and s/. There is a constraint for each ijk finish that 
requires a return of $1.00, should that finish occur. The following formulation ignores our 
effect on the odds and, while this effect is negligible for a return of $1.00, it should be 
included for larger required returns. The formulation that determines the minimum ex- 
penditure for a certain return of at least $1.00 is 


T n 
minimize > ` (wi + pi + si). 
t=1 i=i 
subject to 


T 
Ñ (Riwi + Rip! + Ripi + Rias! + Rias! + Risi) =1 for each i j, k, 


t=1 
wi, pi, si =0 foralli=1,...,nand¢=1,...,T7. 
This linear program has a large number of constraints; with n horses there are 
n(n—1)(n—2) possible ijk finishes and, consequently, n(n—1)(n—2) constraints. (The 


dual program has 37n constraints, however.) Interestingly, even with the possibility of 
place-and-show betting, betting only to win, as in Sec. III, may be optimal. The 1982 
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IV. The Optimal Capital Growth Model 


Section 11] demonstrated an inefficiency in cross-track betting. We 
now propose and test an “‘‘optimal’’ wagering strategy for this 
inefficiency. Bettors at the racetrack appear to have many different 
objectives and, therefore, employ many different wagering strategies. 
Two quite reasonable long-term objectives, however, are (1) to max- 
imize the expected rate of growth of one’s bankroll, and (2) to minimize 
the expected time to reach some specified large wealth level. Breiman 
(1961) proved that both of these objectives are asymptotically satisfied 
by maximizing, on a myopic period-by-period basis, the expected log- 
arithm of one’s final wealth. This approach is termed the optimal capi- 
tal growth model, and its theoretical justification for the logarithmic 
utility function has been well studied (see Ziemba and Vickson [1975] 
for references and a discussion of its assumptions and results). A simu- 
lation by us (Ziemba and Hausch 1986) suggests that this strategy not 
only performs well asymptotically but over a year of wagering it can 
also be expected to outperform other commonly used betting strate- 
gies. Other attractive features of the capital growth model are that 
one’s effect on the odds can be accounted for and bet size is monotone 
in wealth. On the negative side, though, the recommended bets can be 
very large. Indeed, the Arrow-Pratt absolute risk-aversion index is, 
wealth” ', which is close to zero for large wealth. 

Let wo be our bettor’s initial wealth and q; the probability that i is 
first, j is second, and k is third. Also, let Pj; = P; + Pjand Si, = Sj + $ 
+ S4. The optimal capital growth model is? 


n n 


n 
max > i? 5 dijk log | Wo 


t t ot j= j= = 
Wr S i=1 j=) k=l 
{ Pe a ji kžij 


[or F > vt) = (PE + pi + pi) 
— a 


Preakness (with 7 x 6 x 5 = 210 constraints) is one example of this. The reason for no 
place-or-show betting at optimum seems to be a coordination problem. Win bets return 
one and only one positive amount and these payoffs are mutually exclusive. Thus, a 
return of $1.00 can be efficiently guaranteed with win bets. Place-and-show bets, how- 
ever, tend to have many different possible payoffs, and collecting on them is not mutu- 
ally exclusive. These differences make it difficult to efficiently choose place-and-show 
bets over the win bets. 

9. It is very simple to include exotic wagering in this capital growth model. it was 
formulated to consider only win, place, and show betting, though, because that was the 
only available data for testing the model. 
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pi Pj ) 
x |[— m + | 
TaT Pi + p 


[o(s + st) — (Six + sit spt s)| 
4 


+ 
3 
( si si Sk 
Si + si Si t+ si Si + sh 
n n n 
- (Diwe+ > pe+ > si) 
t=1 t= {=l ’ 
ei {Ai LAP K 
subject to 
T n 
>. (we + pe + se) S Wo, 
t=1¢=1 
we = 0, pe 20,520, t=1,...,7,€ =1,...,2. 


This model requires estimates of the win probabilities for each horse. 
The efficiency studies of the win market have indicated that the pub- 
lic’s win odds, adjusted for the favorite/longshot bias, provide good 
estimates of these probabilities. However, with cross-track wagering 
we not only have a different set of win odds for each participating track 
but, as demonstrated by the Ferdinand example in Section I and the 
risk-free hedging model in Section III, these odds can vary consider- 
ably. Rather than take a weighted average of all the tracks’ odds to 
arrive at a set of win probabilities, we decided to use only the home 
track’s odds.'° This decision was based on a perceived informational 
advantage that the home-track public has over the bettors at the cross 
tracks, an advantage that results from several factors: (1) since these 
races were run near the end of the day’s racing, the home-track public 
had watched the jockeys perform in, perhaps, several races already, 
they had observed the condition of the track and possibly noted any 
track biases, and they saw the horses in the paddock and in the parade 
to post; (2) the home crowd knows better if their track tends to favor 
front-runners or late chargers; and (3) since the home track is usually a 
larger track that has many major races, its public is more likely to have 
seen some of these horses race earlier in the season. Further, the 
studies supporting win-market efficiency have all involved home-track 
odds. Thus, we assume win-market efficiency at the home track and, 


10. This decision was not the result of any analysis of the data. There are only a few 
cross-track races each year and the required data for each race are the final tote-board 
figures from several of the racetracks permitting this betting. These difficulties led to data 
on only 10 races being collected, much less than would be required for any analysis. 
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TABLE 4 1982 Preakness 

Horse Finish Pimlico’s Win Odds Win Probability 
1. Reinvested Sii 7.6-1 .090 

2. Cut Away 3d 41.6-1 O11 

3. Water Bank ves 12.0-1 056 

4. Bold Style wee 26.2-1 023 

5. Laser Light ene 5.3-1 125 

6. Linkage 2d 5-1 597 

7. Aloma’s Ruler Ist 6.9-1 098 


after adjusting for the favorite/longshot bias in figure 1, use the home- 
win odds to estimate win probabilities. Since this win-market efficiency 
is not necessarily maintained at the cross tracks, the model can then 
include possible win betting at the cross tracks. The 1982 Preakness 
Stakes will be used to illustrate this model. Table 4 lists the race’s 
entrants and their odds. 

Aloma’s Ruler’s 6.9-1 win odds meant that, when he won, each 
dollar wagered on him was returned with an additional $6.90 profit. 
Linkage was the crowd’s favorite. His 0.5-1 win odds meant that, had 
he won, he would have returned $1.50 per dollar wagered on him. After 
adjusting the win probabilities for the favorite/longshot bias, the proba- 
bilities were then normalized to sum to one. These probabilities and 
equation (1) adjusted for breakage allow the calculation of the expected 
return on an additional dollar bet to win for each horse i at each track ¢. 
Similarly, the Harville formulas in assumption 1 and equations (2) and 
(3), also adjusted for breakage, allow the calculation of the expected 
returns on place-and-show bets (see Hausch, Ziemba, and Rubinstein 
1981). We received final win, place, and show figures from four of the 
tracks that allowed betting on the 1982 Preakness: Golden Gate and 
Los Alamitos in California, Centennial in Colorado, and Penn National 
in Pennsylvania. The calculated expected returns on the various bets at 
the four tracks are presented in table 5. 

If a track has a payback proportion of, say, 0.85 then the average 
return on $1.00 bets, ignoring breakage, will be $0.85. With breakage, it 
will be somewhat less than $0.85. For individual horses the expected 
returns from table 5 vary from 0.181 (show bet on horse 2 at Golden 
Gate) to 1.336 (place bet on horse 6 at Los Alamitos). Of particular 
interest are the expected returns exceeding 1.00 since those indicate 
the wagers that have a positive expected profit. We restricted our 
attention to wagers with expected returns of at least 1.10. They are the 
highlighted expected returns in table 5. If the probability estimates of 
this model were exact, then this 1.10 cutoff could result in suboptimal 
wagering, particularly since diversification can even lead to the inclu- 
sion of wagers with negative expected profits. Hausch, Ziemba, and 
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TABLE 5 Expected Returns, Cross-Track Betting on 1982 Preakness 
Horse 
1 2 3 4 5 6 7 
Finish as 3 Sass ads snd 2 1 
Win probability .090 O11 .056 .023 125 597 .098 


Expected return on 
a $1 bet to win: 


Golden Gate 837.246 437 -708 -788 -955 1.000 
Centennial .900  .389 370 -570 .825 836 1.147* 
Los Alamitos -666 391 .347 TTT. .713 1.015 1.196* 
Penn National 954  .484 588 1.109* -650 896 755 


Expected return on 
a $1 bet to place: 


Golden Gate .749 193.349 582 669 1.149* .880 
Centennial .769 = 238.260 329 719 1.084 1.120* 
Los Alamitos 794.233, 277 556 778 1.336* .888 
Penn National .673 39939] 737 -586 1.101* 731 


Expected return on 
a $1 bet to show: 


Golden Gate 837 181 405 413 817 1.153* .996 
Centennial 747 197.340 252 .803 1.008 1.138* 
Los Alamitos .890  .200 .392 -341 1.180* 1.293* .873 
Penn National .710  .235 451 .388 .769 1.099 .793 


NoTtE.—ln this table, horses 2, 6, and 7 are Cut Away, Linkage, and Aloma’s Ruler, respectively. 
* Highlighted expected returns (returns of at least 1.10). 


Rubinstein (1981) found, however, that due to the approximations in 
the model it was more profitable to wager only if the expected return 
was well in excess of 1.00. For large tracks and races of the quality of 
these Triple Crown races, a minimum expected return of 1.10 is rea- 
sonable. 

The capital growth model was run on the data from these cross 
tracks. Table 6 presents the optimal portfolio of wagers assuming a 
1.10 expected return cutoff and an initial wealth of $2,500. This port- 
folio has wagers totaling $2,437 at all four tracks and its certainty 
equivalent can be calculated as $432. Since the first three finishers of 
this race were horses 7, 6, and 2, our bettor would collect on the win, 
place, and show bets on horse 7 and the place and show bets on horse 
6, for a return of $3,716.90 and a profit of $1,279.90. The payoffs in this 
table account for our bettor’s effect on the odds. For example, the 
actual payoff per $1.00 bet to show on horse 7 at Centennial was $2.70. 
However, our bettor’s $83 show bet at this track would have lowered 
this payoff to $2.50. The latter payoff was used to determine our bet- 
tor’s return. 

Table 5 supposes only $1.00 is wagered. Thus, even though one track 
may have a higher expected return to, say, show on some horse than 
another track has, it can be that our bettor will make show wagers at 
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TABLE 6 Optimal Capital Growth Wagers, Cross-Track Betting on 1982 
Preakness 
Expected $ Payoff $ Total 

Horse Bet Track Return $ Bet per $1 Bet Return $ Profit 
4 Win P.N. 1.109 14 ane eae ~ 14.00 
7 Win L.A, 1.196 40 12.00 480.00 440.00 
6 Place L.A. 1.336 855 1.50 1282.50 427.50 
7 Place Cen. 1.120 46 3.30 151.80 105.80 
5 Show L.A. 1.180 172 ERR si — 172.00 
6 Show G.G. 1.153 571 1.30 742.30 171.30 
6 Show L.A. 1.293 656 1.30 852.80 196.80 
7 Show Cen. 1.138 83 2.50 207.50 124.50 

Totals 2,437 3,716.90 1,279.90 


Note.—P.N. = Penn National; L.A. = Los Alamitos; Cen. = Centennial; G.G. = Golden Gate. 


both tracks because of the bettor’s effect on the odds. This happened at 
Golden Gate and Los Alamitos. Horse 6 had expected returns of 1.153 
and 1.293 at these tracks and the optimal show bets were $571 and $656 
at them, respectively. With these wagers, it must be that the addition to 
expected utility from an additional dollar bet to show will be the same 
at the two tracks.!! 

The capital growth model was studied with cross-track data on the 
other Triple Crown races. In each case it was assumed that the initial 
wealth was $2,500. The results of these races and the 1982 Preakness 
are given in table 7. 

Despite losses on two of the races, including a huge loss on the 1984 
Preakness, there was a total profit of $2,647.80 for a 15% return on 
money wagered. Average profits were $264.78. The standard error of 
the mean is $342, so no statistically significant statements can be made 
about positive profits on the basis of these results. Studying the ex- 
pected returns suggests that many of the profitable overlays occur 
because of regional biases. For instance, Conquistador Cielo, the win- 
ner of the 1982 Belmont Stakes, raced his entire career on the East 
Coast. The West Coast bettors were less familiar with him, while on 
the East Coast he was, for many, a sentimental favorite. In the Bel- 
mont Stakes he tended to be sent off at lower odds at the East Coast 
tracks. When he won he paid from $5.80 to win at Commodore Downs 
in Pennsylvania to $15.40 at Los Alamitos in California. Another ex- 
ample is Tolomeo, the English colt that won the 1983 Arlington Mil- 
lion. His odds were 4-1 in England and 38-1 at Arlington Park in 


11. This marginal utility can be complicated because a component of the expected 
return on an additional dollar bet to show on horse 6 at Los Alamitos is the positive effect 
that it has on the return to the show bet on horse 5 at Los Alamitos if 6 finishes out of the 
money. 


TABLE 7 Results of Optimal Capital Growth Model on Triple Crown Races 
Total Certainty 
No. Horses Wagers Equivalent No. Wagers Return Profit 
Race in Race No. Tracks No. Wagers ($) ($) Won ($) ($) 
Preakness: 
1982 7 4 8 2,437 432 6 3,716.90 1,279.90 
1983 11 8 13 1,949 325 1 1,647.30 — 301.70 
1984 10 3 8 2,282 371 0 0.00 — 2,282.00 
1985 11 6 11 1,817 325 4 2,014.40 197.40 
Belmont: 
1982 I} 4 20 942 305 5 1,880.40 938.40 
1983 12 7 20 2,452 469 6 2,578.80 126.80 
1984 11 2 3 1,371 317 1 2,331.00 960.00 
1985 9 9 15 471 73 1 78.30 — 392.70 
Kentucky Derby: 
1984 12 6 13 2,027 340 5 3,027.20 1,000.20 
1985 12 6 7 1,973 350 3 3,094.50 1,121.50 
Totals 118 17,721 3,307 32 20,368.80 2,647.80 
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Illinois. This observation and table 7 suggest that a high expected 
return is possible even with only a few carefully selected cross tracks. 

The Triple Crown races are the most widely publicized of the North 
American horse races. Thus, one might expect that other less pub- 
licized cross-track races would have even more divergence in the odds 
across the tracks. If so, they should have an even greater potential for 
expected profits. 

A major assumption of this model is that after all other bettors have 
made their wagers our bettor (1) learns the tote-board information at 
each track, (2) runs the capital growth model, and (3) communicates 
the optimal wagers to agents at each track, who then make the wagers. 
Obviously this is extremely difficult, if not impossible, in practice, 
never mind any legal concerns. Just learning the tote-board informa- 
tion at each track is difficult because there are no pay phones inside the 
racetrack grounds. A central decision maker and the agents communi- 
cating with cellular phones is feasible but still requires a significant 
amount of time. Unfortunately, though, odds may change in the last 
few minutes of betting, and profitable bets a few minutes before the end 
of betting may not be profitable based on the final odds. Hausch, 
Ziemba, and Rubinstein (1981) studied the odds changes in the last 2 
minutes of betting and found that expected returns did change some- 
what but profitable place-and-show bets 2 minutes from the end tended 
to remain profitable based on final odds. However, the agents in this 
model, because of its extra complications, would probably have to 
report odds more premature than those 2 minutes before the end of 
betting. The results given in this article, then, may overestimate the 
profits possible in practice. Additionally, our profit figures do not ac- 
count for the costs of implementing this procedure—the agents and 
other costs at each cross track, the long-distance phone calls, and the 
computer time. 


V. Testing the One-Track Capital Growth Model 


Implementing the previous section’s capital growth model is certainly 
difficult. However, when the race is televised from the home track 
there are simpler versions of this scheme possible for one bettor at one 
track. Our bettor, with a portable television at the cross track, can 
view the home odds when they are shown on TV. With these odds 
giving “true” win probabilities, our bettor can search for overlays at 
the cross track. A flat bet to win could be made on horses going off at 
longer odds than at the home track, or a sophisticated bettor could also 
bring a portable computer to the track and run the capital growth model 
described in Section IV for one track, that is, T = 1. This latter scheme 
is tested here with the Triple Crown race data. 

As an example, consider the 1984 Kentucky Derby simulcast at 
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TABLE 8 Optimal Wagers Based on One-Track Capital Growth Model, 1984 
Kentucky Derby 
Wager Expected $ Payoff on $ Realized 
Type Horse Return $ Optimal Bet a $1 Bet Return $ Profit 
Win 2 1.236 15 sae ae ~ 15.00 
Win 10 1.159 ae 5.60 ates 
Place 2 1.153 aan oe Sau aor 
Place 10 1.410 249 3.70 921.30 672.30 
Show 2 1.353 261 wee hs — 261.00 
Show 10 1.275 295 2.50 737.50 442.50 
Totals 820 1,658.80 838.80 


Golden Gate Fields in Albany, California. Using the win odds televised 
from Churchill Downs, Golden Gate had six wagers with expected 
returns exceeding 1.10. With a wealth of $2,500, the capital growth 
model yields the optimal wagers shown in table 8. 

This portfolio has four wagers totaling $820. The win bet on horse 
number 10 and the place bet on horse number 2 are zero even though 
they have expected returns exceeding 1.10. This is because the possi- 
bility of number 2 doing well in the race is better accounted for with the 
higher returning win-and-show bets on him. Also, the possibility of 
number 10 doing well is better accounted for with the higher returning 
place-and-show bets on him. Swale, number 10, won the 1984 Derby, 
followed by Coax Me Chad and At The Threshold for a 10-12-9 finish. 
Therefore, only the place-and-show bets on Swale paid off for a return 
of $1,658.80 and a profit of $838.80. 

Table 9 presents the results of this one-track model on the other 
races. The average wagers on a race varied from $78.33 to $1,173.25 
and the average profits varied from — $868.67 to $824.10. The average 
of these 10 average profits was $69.97 or 9.2% on the money wagered. 
Again, there is such variability in the profits that, without additional 
data, no statistically significant statements can be made about positive 
expected profits. 


VI. Final Discussion 


There is considerable evidence that the win market at the racetrack is 
weakly efficient. A risk-free arbitrage model is presented that demon- 
strates that this is not the case with cross-track wagering. The mispric- 
ing by the public at the cross tracks may be due partly to their more 
limited access to information relative to those attending the home 
track. Also, some of the variance in the odds across the tracks may be 
due to certain horses being more familiar to bettors in certain regions of 
North America. The optimal capital growth model suggests that the 
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TABLE 9 Results of One-Track Capital Growth Model on Triple Crown Races 
Average Average Average 
No. Cross-Track Cross-Track Cross-Track 
of Cross Wager Realized Return Profit 
Race Tracks ($) ($) ($) 
Preakness: 
1982 4 1,173.25 1,724.80 551.55 
1983 8 909.25 353.79 — 555.46 
1984 3 868.67 .00 — 868.67 
1985 6 799.67 789.37 — 10.30 
Belmont: 
1982 4 407.25 639.02 231.78 
1983 7 532.71 488.63 — 44.08 
1984 2 1,158.00 1,982.10 824.10 
1985 9 78.33 7.80 — 70.53 
Kentucky Derby: 
1984 6 520.67 884.92 364.25 
1985 6 1,161.33 1,438.42 277.08 
Average 5.5 760.91 830.88 69.97 


discrepancies across the tracks can allow profits, but further work is 
needed to demonstrate significant profits. A simpler version of the 
optimal capital growth model for one bettor at one cross track also 
demonstrates the possibility of profit. 
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The folklore of investment is replete with stories of arbitrage 
opportunities where profits can be made without risk. Such a 
“lock” exists at the racetrack. A simple model provides a cri- 
terion for existence of a set of bets to create the arbitrage plus 
the size of the various investments. 


Ro return only about 82 per- 
cent of the money wagered to the 


winning bettors. Thus, the average bettor 
loses about 18 percent of his or her wag- 
ers. There are two parts to this loss: (1) 
the track take is the predetermined per- 
centage of the betting pool that is kept by 
the track; and (2) breakage is the process 
of rounding down the payoffs on winning 
bets to common payoff amounts. Handi- 
cappers have proposed many systems for 
beating the track but an 18 percent disad- 
vantage is a formidable hurdle to over- 
come. Hausch, Ziemba, and Rubinstein 
[1981] and Hausch and Ziemba [1985] de- 
vised a method that allows the average 
player to win by utilizing the betting bias 


423 


of the players along with an investment- 
decision mode! to determine when and 
how much to wager. The method works 
well, but it is far from risk free. This is 
also true of alternative approaches that 
are discussed by Asch, Malkiel and 
Quandt [1984, 1986], Thaler and Ziemba 
[1988] and Hausch and Ziemba {forth- 
coming a]. However, there are situations 
at the track where it is possible to con- 
struct a risk-free hedge, or in the racing 
vernacular, a lock. The only publications 
we know of that concern locks are Willis 
[1964], Leong and Lim [1989], and 
Hausch and Ziemba [forthcoming b]. The 
latter two deal with cross-track betting 
where a number of tracks each offer 
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odds, possibly very different, on the 
same race. While it may be possible to 
make risk-free profits with cross-track 
betting, it involves considerable overhead 
at the various locations, and the bettors 
may encounter legal problems with com- 
municating the information. Willis [1964] 
discusses a much cleaner situation with 
all activity concentrated on one race at 
one track. When the public's wagers in 
the win market (one collects if one’s 
horse is first) and the place market (one 
collects if one’s horse is first or second) 
are very different, Willis shows how arbi- 
trage between the two markets may be 
possible. However, it would be extremely 
rare for the odds in the two markets to 
vary sufficiently. In fact, Willis supplies 
no actual examples. Our lock concentrates 
on another market, the show market. 

One collects on a show bet if the horse 
finishes first, second, or third. Let S, rep- 
resent the public’s show bet on horse i 
and let Q be the track’s payback propor- 
tion (about 0.80 to 0.86). Finally, let n be 
the number of horses and let S = 2 S; be 
the show pool. Then if horses i, i “and k 
finish 1-2-3 in any order, the payoff per 
dollar wagered on i is 
14 QS - §,- S - Sk 

35S; 
The ticket holders of the first three horses 
are repaid their original bets plus a share 
of the profits. In practice, breakage 
rounds this payoff down to the nearest 5 
cents or, more commonly, 10 cents. With 
INT[Y] giving the largest integer not ex- 
ceeding Y, the formula for the payoff in- 
ee the effect of breakage is 
INT Í NIQS — S; - S,- S4 
3S; 


(1) 


1 2 


1+4 
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10 or 20 for 10 cents or 5 
cents breakage, respectively. If this payoff 
is less than $1.05 for each dollar wagered, 
then most tracks guarantee a return of 


where N = 


$1.05, that is a five percent profit. 
Situations where the track must honor 
this guarantee of a five percent minimum 
payoff are called minus pools. They result 
in the track collecting less than (1—Q)S, 
and hence, tracks try to avoid them 
whenever possible. However, they are not 


Our lock concentrates on the 
show market. 


uncommon. We focus here on a particular 
type of minus pool, one where a heavy 
favorite has about 95 percent of the show 
pool bet on it, that is where S, = 0.955. 
The crowd figures that this horse, or 
group of horses if it is a betting entry, is 
so good that it almost surely will finish at 
least third. In such a situation, one can 
construct a lock in the show pool. 

Locks were described in articles in 
Sports Illustrated [Gelband 1979] and For- 
tune [Seligman 1979] using the example in 
Table 1, the Alabama Stakes at Saratoga 
on August 11, 1979. 

Davona Dale’s 95.5 percent share of the 
show pool is unusually high and has cre- 
ated a minus pool. If Davona Dale fin- 
ishes in the money, that is, at least third, 
then the show payoffs on the first three 
finishers will be the minimum $1.05 per 
dollar bet. Here is how a lock can be de- 
vised: If Davona Dale is in the money, 
then we will receive five percent on the 
money we wagered on her plus five per- 
cent on the money we wagered on the 
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Win Odds Win Show % of 

Horse to $1 Probability Bet Show Pool 
Davona Dale 0.30 0.661 $435,825 95.5 
It's in the Air 3.10 0.210 7,901 1.7 
Mairzy Doates 13.40 0.050 4,518 1.0 
Poppycock 17.50 0.037 4,417 1.0 
Croquis 15.40 0.042 3,873 0.8 
1.000 $456,534 100.0 


Table 1: The public's win odds and show bets are given for the 1979 Alabama Stakes at 
Saratoga. The conversion of win odds to win probabilities accounts for the public’s biases 
(see Ziemba and Hausch [1986] and Hausch and Ziemba [forthcoming b]). 


other two horses that finish in the money. 
If this adds up to more than the wagers 
we lost on the fourth and fifth place 
horses, then we are ahead. If, as well, the 
amounts we wagered on the four long- 
shots are such that, if Davona Dale fin- 
ishes out of the money, our return covers 
both the bet on her and on the other out- 
of-the-money horse, then we have a profit 
regardless of the outcome of the race. A 
lock is clearly a very conservative betting 
strategy; it is consistent with a utility 
function that has infinite disutility for any 
losses. Later we will discuss the logarith- 
mic utility function for comparison, but 
for now we will describe the conditions 
that are necessary for a lock and recom- 
mend wagers on the horses to develop a 
lock. 

Suppose we wish to receive approxi- 
mately the same profit regardless of the 
finish. Initially, let us assume that (1) ex- 
cept for the favorite, the public wagers 
the same amount on each horse to show, 
and (2) our bets do not affect the odds. 
With k as the fraction of the show pool 
on the favorite, the show bet on the favor- 
ite is kS and the equal show bets on the 
other horses are (1—k)S/(n—1). Let x be 
our wager on the favorite and y be our 


show bet on each of the other horses, for 
a total wager of x+ (n — 1)y. If the favorite 
is in the money then we collect five per- 
cent on x and two of the y bets, but lose 
(n—3)y on the n—3 losers, for a profit of 
.05(x + 2y) —(n— 3)y. If the favorite is out 
of the money, then, since it has been sup- 
posed that our wagers do not affect the 
odds, our profit is 


Os: = 3(1 —k)S 
3 ee y 
3 (1 ~kyS/(n — 1) 
— x — (n-4)y. 


The first term is the profit on the three 
horses that finished in the money, and it 
follows from expression (1). The next two 
terms are the losses on the favorite and 
on the —4 other horses. 

To guarantee a particular return regard- 
less of the finish, these two profit func- 
tions must be equal. This means x and y 
must satisfy the following ratio: 
xly = —2 + Q(n—1)/{1.05(1 —k)]. (3) 
This x/y ratio yields profit of .0S5yQ(m— 1)/ 
[1.05(1 —k)] - (n—3)y, and this must be 
positive for a lock to exist. This holds 
when 
k>1 - Q(n-1)/[21(n—3)). 

Our example does not satisfy the first 


(4) 
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assumption: the bets on the long shots are 
not equal; they range from $3,873 to 
$7,901. The second assumption will hold 
only if our wagers are relatively small. 
Despite this, Q=.85 and n=5 for our ex- 
ample, so condition (4) indicates that a 
lock will exist if k exceeds 0.920. Davona 
Dale’s k=0.955 so we can devise a lock. 
Equation 3 gives x/y = 69.96, so with 
$2,500 to bet we would wager x= $2,364 
on the favorite and y= $34 on each of the 
other four horses. 

If Davona Dale finished in the money, 
then our profit would be $53.60. If the 
public’s show bets on the other four 
horses had been the same, then the profit 
would also have been $53.60 even if 
Davona Dale finished fourth or fifth: a 
guaranteed 2.1 percent return. The fact 
that the show bets on the four other 
horses are different will not affect our 
profit of $53.60 if Davona Dale finished at 
least third. If Davona Dale finished fourth 
or fifth, however, our profit would de- 
pend on which of the four horses were 
the top three finishers. Table 2 lists the 
profit for the four possible permutations 
of these four horses being the top three 
finishers. If the four profits are weighted 
by their likelihoods, the average profit is 
about $53.60. 

In this analysis, we supposed that our 
bets do not effect the payoffs. For this ex- 
ample and a total wager of $2,500, this 


Top Three Finishers (order does not matter) 


It’s in the Air — Mairzy Doates — Poppycock 
It’s in the Air — Mairzy Doates — Croquis 
It’s in the Air — Poppycock — Croquis 
Mairzy Doates — Poppycock — Croquis 
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assumption has been fine. In reality, the 
effect of our wages on the payoffs would 
reduce each of the profits in Table 2 by 
less than $7.00 and have no effect on our 
$53.60 profit if Davona Dale was in the 
money. The effect on the payoffs of a 
much larger total wager is more serious 
though. For example, betting a total of 
$25,000 cannot be expected to allow a 
guaranteed profit of about $536.00. 

Equation 4 shows that the condition for 
a lock is more easily met when n is small 
or Q is large or both. Unlike most states 
which have a guaranteed minimum five 
percent return, Louisiana has a minimum 
return of 10 percent. Equation 4, revised 
for Louisiana, yields the less restrictive 
lock condition, k > 1 — Q(n-—1)/ 

[11(n —3)]. If Q = .85 and n = 5, as they 
do in our example, then k = .846 is suffi- 
cient for a lock. 

Conditions (3) and (4) assumed that the 
public wagers the same amount on each 
of the nonfavorite horses. It is possible to 
treat the case where these amounts are 
different, as they are in this Alabama 
Stakes example. Let x continue to be our 
wager on the favorite, Davona Dale, but 
now let y, Yz} y3, and y, be our wagers on 
It’s in the Air, Mairzy Doates, Poppycock, 
and Croquis, respectively. Let R be our 
guaranteed return. Table 3 shows the re- 
turn on each horse for each possible tri- 
plet of winners. The linear program for 


Profit 


$ 12.60 
148.60 
172.40 
597.40 


Table 2: If Davona Dale finishes out of the money, then profit depends on the identity of the 
top three finishers. The order in which they finish does not matter, though. 
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maximizing the guaranteed return to a 
$2,500 bankroll is 

Maximize R 

subject to 

x+ yı + y + y; + y, = 2500 


O5(x + y, + y) — 4 ~Y ZR 
O5(x + y, + ys) — y- yR 
O5(x + y, + Ys) -yY YVR 
O5(x + y + y) >y — Ww ZR 
O5(X + y + y) —Y¥r yR 
O5(x + y + Ys) — ¥2 — ¥2 ZR 


15.60 y, + 27.30 y, + 28.00 y; — x - y; 
2R 

15.60 y, + 27.40 y, + 31.90 y, — x - y; 
2R 

15.60 y, + 28.00 y; + 32.00 y - x ~ 4: 
=R 

27.60 y, + 28.30 y; + 32.20y, —- x — y 
2R 

X, Yir Yr Yar Ya = O. 

The solution is R* = $52.47 with x* = 

$2366.04, y* = $34.54 for i=1,2,3, and y,* 
= $30.34. So, the wagers on the horses 

are close to those made in the equal- 

public-wagers case. Constraints 2, 3, and 

5 have surpluses of $4.41 while con- 

straints 10 and 11 have surpluses of $23.75 

and $454.66, respectively. The remaining 

constraints are binding. 

The lock condition is seldom met, and 
to get a substantial return one needs a 
large bankroll. However, the times when 
it is met should not be complete sur- 
prises; it happens when there is an ex- 
treme favorite that the crowd figures 
cannot be out of the money. Davona Dale 
allowed another lock in the 1979 Coaching 
Club American Oaks at Belmont. She 
won that race with 97.8 percent of the 
show pool on her! Two years after Davona 
Dale won that race, the entry of Heavenly 
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Cause and De La Rose had 95.2 percent 
of the 1981 Coaching Club American 
Oaks show pool. Spectacular Bid ran 30 
races and finished out of the money only 
once, and that was as a two-year-old. He 
was such a standout that he often went 
off at 1-20 odds, a good sign that a lock 
may exist. One lock on him was the 1980 
Amory Haskell Handicap where he had 
96.0 percent of the show pool. A more re- 
cent example is Easy Goer’s impressive 
win in the 1989 Gotham Stakes at Bel- 
mont Park. Easy Goer had 97.1 percent of 
the show pool of $553,658. We thank 
Peter Arnold for bringing this lock to our 
attention. 

If a horse that has virtually all the 
show money bet on it loses, the payoffs 
can be very high. In fact, show payoffs 


The show payoff on Arbor 
Hoggart is thought to be the 
highest show payoff of any 
sort. 


can exceed win payoffs. This happened 
when the public’s favorite, Kassa Branca, 
finished last in the New Jersey Futurity at 
Freehold Racetrack on October 22, 1988. 
Two bettors had wagered $50,000 and 
$150,000 to show on him. Others brought 
the show bet on Kassa Branca up to 
$213,837. That amounted to 98.8 percent 
of the show pool of $216,492. The payoffs 
per two-dollar wager were 


WIN PLACE SHOW 

Beta Bob 41.20 11.20 312.40 
Nukes Image 7.00 113.00 
340.80 


Arbor Hoggart 
The show payoff on Arbor Hoggart is. 


thought to be the highest show payoff of 
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Probability of Respective 
Top three horses these being the show payoffs 
top three horses per $1 bet 
Davona Dale-It’s in the Air-Mairzy Doates 0.34468 1.05, 1.05, 1.05 
Davona Dale-It’s in the Air-Poppycock 0.25139 1.05, £.05, 1.05 
Davona Dale-It’s in the Air-Croquis 0.28692 1.05, 1.05, 1.05 
Davona Dale-Mairzy Doates-Poppycock 0.03589 1.05, 1.05, 1.05 
Davona Dale-Mairzy Doates-Croquis 0.04109 1.05, 1.05, 1.05 
Davona Dale-Poppycock-Croquis 0.02970 1.05, 1.05, 1.05 
It's in the Air-Mairzy Doates-Poppycock 0.00330 16.60,28.30,29.00 
It's in the Air-Mairzy Doates-Croquis 0.00376 16.60,28.40,32.90 
It’s in the Air-Poppycock-Croquis 0.00274 16.60,29.00,33.00 
Mairzy Doates-Poppycock-Croquis 0.00053 28.60,29.30,33.20 
1.00000 


Table 3: The probability of any three horses being the in-the-money finishers is given, along 
with the resulting show payoffs. These probabilities were computed using the Harville [1973] 
formulas. The probability of an i, j, k finish, where q,is the probability of winning, is 9,4,9,/ 
[1 —4) (1—q4,—q,)]. Stern [1987] discusses alternative probability models (see also the 
discussion in Hausch and Ziemba [forthcoming al). 


any sort. The previous harness racing re- 
cord was $296.00 in 1986 at Dover 
Downs. This extreme example would 
have allowed a lock returning close to 
four percent. (Many thanks to Pete Asch 
for pointing out this race to us). Other ex- 
amples are mentioned in Ziemba and 
Hausch [1987]. 

Since the lock strategy requires a guar- 
anteed profit, it is much more conserva- 
tive than the optimal capital growth 
strategy described in Ziemba and Hausch 
[1987]. The optimal capital growth strat- 
egy asymptotically maximizes the rate of 
growth of one’s bankroll. This is achieved 
by maximizing, in a myopic race-by-race 
fashion, the expected value of log utility. 
A comparison of these two strategies can 
be made with the Alabama Stakes race. 
Table 3 shows possible payoffs on each 
horse and their likelihoods, and Table 4 
gives the expected return to a show bet 
on each horse. 

Only the show bets on Davona Dale 


and It’s in the Air have positive expected 
profits. With an initial wealth of $2,500, 
the optimal capital growth bets are 


Davona Dale $2,294 
It's in the Air 203 
Mairzy Doates 0 
Poppycock 0 
Croquis 3. 


The entire $2,500 is wagered, and regard- 
less of the order of finish our bettor does 
not go bankrupt. The certainty equiva- 
lent, that is, the certain return giving the 
same utility as the expected utility of the 
gamble using the logarithmic utility func- 
tion, is $101. The wager on Croquis has 
an expected return of only $0.607 on the 
dollar. While it is unlikely that both 


Horse Expected Return 
on a Show Bet 

Davona Dale 1.039 

It's in the Air 1.090 

Mairzy Doates 0.658 

Poppycock 0.524 

Croquis 0.607 


Table 4: The expected return to show is 
calculated for each horse. 
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Top three horses Profit 
Davona Dale-It’s in the Air-Mairzy Doates $121.85 
Davona Dale-It’s in the Air-Poppycock 121.85 
Davona Dale-It’s in the Air-Croquis 125.00 
Davona Dale-Mairzy Doates-Poppycock — 91.30 
Davona Dale-Mairzy Doates-Croquis — 88.15 
Davona Dale-Poppycock-Croquis — 88.15 
It’s in the Air-Mairzy Doates-Poppycock 808.20 
It’s in the Air-Mairzy Doates-Croquis 908.20 
It's in the Air-Poppycock-Croquis 908.20 
Mairzy Doates-Poppycock-Croquis — 2,399.80 


Table 5: Using the optimal capital growth wagers, profits are given for the possible trios of 
horses in the money. The profit is not affected by the order of the three, though. 


Davona Dale and It’s in the Air will finish 
out of the money, Table 3 shows that, in 
that case, Croquis has a better payoff 
than Mairzy Doates and Poppycock. Table 
5 presents the possible profits from these 
wagers (accounting for their effect on the 
odds). 

Tables 3 and 5 show that the chance of 
a loss is 10.7 percent and the expected 
profit can be calculated to be $106.28, or 
4.25 percent on the bankroll. Clearly, this 
expected return is considerably higher 
than the lock’s certain 2.1 percent return. 
The actual finish, Poppycock-Davona 
Dale-It’s in the Air, yielded a profit of 
$121.85. 

Such a lock exists maybe five to ten 
times a year in North America. While you 
are enjoying the performance of a super 
horse, you may make some profit as well. 
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Abstract 


We present arbitrage and risk arbitrage betting strategies for team jai alai. Most of the 
results generalize to other sports betting situations and some financial market applica- 
tions. The arbitrage conditions are utility free. The risk arbitrage wagers use the Kelly 
expected log criterion. 


Keywords: arbitrage, risk arbitrage, hedging, sequential investing 


1. INTRODUCTION 


This chapter discusses arbitrage and risk arbitrage strategies for betting on team Jai 
alai.! The game originated in the Basque region of Spain and is played in Mexico City, 
Connecticut, Florida, Nevada, Rhode Island, and other locales (see Hollander and 
Schultz, 1978). It is played in a large enclosed rectangular court called a fronton 
between two opposing teams, each having two players. Players serve each point in turn 
and single points are scored by one team winning a rally off the serve, as in squash, 
racquetball, or tennis. Opposing teams alternate hurling and catching a ball (the pelota 
made of goatskin and hard rubber which must be recovered every 15 min of play) with 
an enlarged basket (cesta) against a wall (granite or concrete). When one team misses, 
the other team scores a point. The game is fast and exciting. Games are usually played 
to 30 points. At the fronton, bets may be placed on either team to win the game before 
every point is played at fixed locked-in odds, until the outcome of the game. Payoffs on 
bets made during the game are settled at the end of the game based on the quoted house 
odds at each betting point. We construct arbitrage and risk arbitrage bets with zero or 
little risk while at the same time yielding a positive return. 

Arbitrage occurs in strategies when the net gain of all bets is always non-negative 
and sometimes positive and involves no risk of losing. Conditions that lead to arbitrage 
in various circumstances are studied in Kallio and Ziemba (2007). Risk arbitrages may 
yield losses, but occur more frequently and have higher mean returns. We develop these 
arbitrages for team jai alai. Section 2 provides conditions for arbitrage. Risk arbitrage 
is discussed in Section 3. Final remarks and applications to other areas are discussed in 
Section 4. 

Assume that 


1. The jai alai fronton bet payout rate is the constant Q € (1,0). 

2. The two teams’ relative ability is known and defined by the probability of win- 
ning a single point—team A wins with the score invariant probability p, and B 
with q = 1 — p. 


The probability of A reaching K points before B, given that A currently has 
O<m<K points and B has 0<n< K points, is (according to Montmort, see 


! Goodfriend and Friedman (1975, 1977) and Skiena (1988) have analyzed the game of individual jai alai. 
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Epstein, 1977, p. 109): 


P, = P,(m, n) 
= px | +k- m+ p EE mt) me SAED 
Ken-t (2K-m-n-2)! 
(K-m-1IXK-n-1)! 


K-n-] 
K-m+i-1 
— »,K-m i 


For K = 30, P, is the probability that team A will win the game given the current score 
is m to n. A schedule of P, and P, for all values of m and z over the 30 point game in 
abbreviated form appears as Table 1 for the case of p = 0.5. 

For given fixed Q, a schedule, as shown in Table 2, can be computed from the 
consistent house odds over the game since 


tee tg 


P, = Q/(O, + 1) and P, = Q/(O; + 1) or 


Q Q Q-P, 
——— oe es : 1 
(0) P, 1 and O, P, P, (1) 


In Equation (1) O, and O; are the consistent house odds for teams A and B respectively, 
when the score is (m, n). Odds of 1.5 to 1 means 1.5 profit plus the original bet or 2.5, 
is returned for each 1 bet, and so on. Consistent odds are those that return 1 — Q for 
the house’s profit regardless of which team, A or B, wins. Then the expected return per 
dollar bet on each team is Q. Since P, + P, = 1 independent of Q, the odds then reflect 
the actual value of Q through Equation (1). Hence 


_ Q- PaXQ — Pr) =,;-2@0-9) 


O ph A 
eO; P,P, P,P, 


(2) 


For given Q and odds on A of O, to 1, consistent odds on B may fail to exist that 
guarantee the house advantage 1 — Q. In that case, if there is a minimum payout 
of 1, that is, you just get your money back, then there is a minus pool and the house’s 
actual take is less than 1 — Q. A minus pool is defined to be this situation where the 
house take is 1 — Q* < 1 — Q, where Q* > Q. So the effective Q, namely Q*, is higher 
and the odds on B always exist since at their lowest the odds on O, are the reciprocal 
of the odds on A. That is the case where the payback is all the money wagered and 
the house makes no profit since Q* = 1. In general, these odds are possibly as low as 
the minimum guarantee, that is, O, = 1/O,. In a typical minus pool, the odds on B 
are higher than 1/O,. For example, if the minimum payout is 1.05, as is typical, then 
consistent odds on O, may fail to exist. In this case, the odds on A are too large given 
the 1 — Q* demanded by the minimum guarantee so that the odds on B do not exist. 
Let O, be Q = 1 consistent odds (i.e., odds that give Q = 1) and Oy, are related by 


TABLE 1 Probability that Team A Wins When the Score Is A = m and B = n and the Single Point Probability is p = .5 

a/n o 6% 2 3 4 5 6 7 8 9 30 44 12 $3 14 9S 16 97 98 19 20 21 22 23 24 25 26 7 20 29 

° +80 .55 .60 .65 .70 .75 .79 .83 .B7 .90 .92 .94 .9% .97 698 199 199 

1 245 2.50 .SS .61 .66 .71 .76 .80 .84 .67 .90 .93 .95 .96 .96 .98 .99 .99 

2 +40 .45 .50 .55 .61 .66 .71 176 .80 .86 .88 .91 .93 .95 .97 .98 .99 .99 

3 +34 139 .45 .50 «56 161 666 .72 276 .8t .85 .88 .91 194 .96 197 .98 .99 99 

4 -JO «34 139 144 050 156 .6t 067 272 1.77 .81 185 189 .92 .94 .96 197 1.98 .99 .99 

5 «25 629 .J4 .39 .&4 .50 .56 .6} .67 .72 .77 .82 .86 .89 .92 .95 .96 .98 .99 .99 

6 -21 124.29 134 039 044 150 .56 .62 .67 .73 .78 .83 .87 .90 .93 .95 .97 .98 199 .99 

1 -17 .20 .24 .28 .33 .39 .44 .5O .56 .62 .68 .?3 .79 .83 .87 .93 .93 .96 .97 .90 .99 .99 

8 243.16 20 124 .28 .33 238 144 150 .56 .62 .68 .74 .79 .84 .88 191 194 196 097 .99 199 

9 230 213 216 239 423 128 239 038 .44 150 .S6 .63 .69 274 60 .64 .89 192 .94 .96 .98 199 99 

10 -0G 610 212 .15 219 123 227 632 238 144 250 256 163 .69 .75 1.60 .85 .89 193 19S .97 98 299 

n -06 07 .09 .12 .15 .18 .22 .27 .32 .37 .44 .50 .$7 .63 .70 .76 .Bi .B6 .90 .93 .96 .97 .99 .99 

2 204 0S 607 09 210 214 217 421 26 031 .37 143 .SO .57 .64 .70 .76 .B2 287 091 .94 .9%6 196 .99 .99 

3 203 D4 .05 .06 06 .1) .13 .17 .21 .26 .3t .37 .4) .50 .57 .64 271 .77 .83 .88 .92 .95 .97 98 .99 

14 202 .02 203 .04 .06 .08 410 .13 .16 .20 «25 .30 36 -43 050 .S7 264 271 278 64 89 192 .95 197 199 199 

15 -Ot .02 .02 .03 .04 .05 .07? .09 .12 .16 .20 .24 .30 .36 .43 .SO .57 .65 .72 .?9 .85 .89 .93 .%6 .98 .99 

16 +01 Ot 01 02 .03 O4 605 .07 .09 .11 615 .19 24 .29 .36 .43 050 S56 16S .73 .BO .86 .9) .94 ,9? 98 199 

7 .0) .01 .01 .02 .02 .03 .04 .06 .06 .31 .14 218 .23 129 035 142 50 .SH .66 .74 .81 .87 .92 .95 .98 199 

8 +O) 01 Ot 02 .03 .04 .06 .07 .10 .t3 .?? ,22 .28 .35 .42 .5Q .S8 .67 .75 .82 .88 .93 .96 .98 .99 

19 201 .01 01 .02 .03 .04 .05 .07 .09 .12 96 .21 427 .J4 .42 .SO .59 .68 .76 .83 .89 .94 197 .99 

20 »O3 .OF Oi .02 .03 .04 .06 .08 «th 215 220 226 233 243 .$50 .59 69 1.77 85 .91 19S 198 199 

ai -01 .01 .01 .02 .03 .04 .05 .08 39 114 199 625 132 241 050 1.60 .70 .79 1.87 293 .97 99 

22 -O1 Ot .01 .02 .03 605 07 .O9 213 .18 124 31 .40 150 .60 .71 161 689 .95 .98 

23 -01 .0t .02 .03 .04 .06 .06 «12 .17 1.23 130 .40 .50 61 .73 163 191 296 99 
24 +01 .01 O1 02 03 OS .07 .1f .15 228 .29 .)9 250 .62 .75 .06 .9%4 .98 
25 ‘Ot OI .02 .02 .04 .06 09 .1) .19 .2? .38 .SO .64 .7?? 89 .97 
z6 +01 .O1 .02 .0} .0S .07 «51 17 1.25 .36 .50 .66 .Bt .94 
2 =O) .01 202 .03 .05 .09 1.14 123 .34 250 169 .88 
28 201 O3 602 1.04 .06 613 619 631 150 17S 
29 -Ot .02 .03 .06 .13 .25 .50 


NOTE: This table is symmetric in the sense that Prob(A wins) with score m, n equals Prob(B wins) with score n, m. This occurs if and only if p = .5. 


Source: Lane and Ziemba (2004). 
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TABLE 2 Consistent Odds for Teams A and B for Given Q 


Payback rate, Q 


257 


House odds, O,/1 1.00 0.950 0.900 0.850 0.800 0.750 0.700 
0.100 10.000 9.025 8.100 7.225 6.400 5.625 4.900 
0.111 9.000 8.123 7.290 6.503 5.760 5.063 4.410 
0.125 8.000 7.220 6.480 5.780 5.120 4.500 3.920 
0.143 7.000 6.318 5.670 5.057 4.430 3.938 3.430 
0.167 6.000 5.415 4.860 4.335 3.840 3.375 2.940 
0.200 5.000 4.512 4.050 3.613 3.200 2.813 2.450 
0.250 4.000 3.610 3.240 2.890 2.560 2.250 1.960 
0.333 3.000 2.707 2.430 2.167 1.920 1.688 1.470 
0.500 2.000 1.805 1.620 1.445 1.280 1.125 0.980 
1.000 1.000 0.903 0.810 0.723 0.640 0.563 0.490 
1.500 0.667 0.602 0.540 0.482 0.427 0.375 0.327 
2.000 0.500 0.451 0.405 0.361 0.320 0.281 0.245 
2.500 0.400 0.361 0.324 0.289 0.256 0.225 0.196 
3.000 0.333 0.301 0.270 0.241 0.213 0.188 0.163 
3.500 0.286 0.258 0.231 0.206 0.183 0.161 0.140 
4.000 0.250 0.226 0.202 0.181 0.160 0.141 0.123 
4.500 0.222 0.201 0.180 0.161 0.142 0.125 0.109 
5.000 0.200 0.180 0.162 0.144 0.128 0.113 0.098 
5.500 0.182 0.164 0.147 0.131 0.116 0.102 0.089 
6.000 0.167 0.150 0.135 0.120 0.107 0.094 0.082 
6.500 0.154 0.139 0.125 0.111 0.098 0.087 0.075 
7.000 0.143 0.129 0.116 0.103 0.091 0.080 0.070 
7.500 0.133 0.120 0.108 0.096 0.085 0.075 0.065 
8.000 0.125 0.113 0.101 0.090 0.080 0.070 0.061 
8.500 0.118 0.106 0.095 0.085 0.075 0.066 0.058 
9.000 0.111 0.100 0.090 0.080 0.071 0.063 0.054 
9.500 0.105 0.095 0.085 0.076 0.067 0.059 0.052 

10.000 0.100 0.090 0.081 0.072 0.064 0.056 0.049 


Source: Lane and Ziemba (2004). 


Os, = Q*O, — (1 — Q*). Actual house odds, Oa, may differ from the consistent odds 
O, and O, for a number of reasons, including the desire of the oddsmakers to balance 
their books, competition among individual bookies for larger shares of the total pool, or 


additional information about teams’ performance. This then adjusts the actual 1 — Q* 
that the house receives. 
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For team A, or respectively B, the house odds are: Oaa = Oa (consistent), Og, > Ou 
(favorable), and Oaa < Oa (unfavorable). 


2. THE ARBITRAGE 


Arbitrage is the proverbial sure bet. For each betting point of a K point game with a 
utility function U and betting wealth W, optimal arbitrage bets can be found by solving 


max E [U (Ba, Bs)] (3) 
B,>0,B,>0 


s.t. B+B, SW 
BaOan + Ba > Ba + Bp 
B, Oy, + B, > Ba + By 


where B, and B, are the amounts bet on A and B, respectively, and E is the expectation 
operator. Besides the budget constraint, the arbitrage constraints indicate that the return 
if either A or B wins is never less than the total bet on A and B. This reduces to the 
arbitrage betting condition at every point of the game. These constraints yield 


1/Oan < Ba/Br < On, By #0, (4) 
which demonstrates: 
Theorem 1. The arbitrage exists if 
OchOvn 2 1. (5) 


The arbitrage condition in Equation (5) is utility free and holds for all U. Both 
B, and B, must be positive or both zero. 


The constraints in Equation (3) imply that 
B,/ By, < W/B, - 1. 
By Equation (4), 
1/Oan, < W/B, —- 1. 
Hence 
By < OanW/(1 + Oan). 
Similarly, 


Ba 2 w/a + Oan). 
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We consider betting strategies using the strategy variable f > 1, using 
Ba = fW/(1 + Oan) and By = W(1 — f + Oar)/C + Oan), 
when 
OchOnm = 1, Ba = W/(1 + Oan), Bo = OchW/(1 + Oan) and f = 1. 


To guarantee the arbitrage, the profit must be non-negative regardless of which team 


wins: 
profit if A wins = W (Oan f — (1 — f) — Oan)/(1 + Oan) (6) 
=W(f-1)20, 
profit if B wins = W (O (1 — f) + Ox,0an — P) + Oar) (7) 


= W (Onl + Oan) — FC + Oon))/CL + Och) > 0. 
To satisfy Equations (5—7) 
Smin = 1S f < Opn(1 + Ogh)/C1 + Obr) = fmax: (8) 
Figure 1 illustrates how the net payoffs vary with f. For 
S” = (Oan + 1)(Oon + 1)/(1 + Och + Ooh +2), (9) 
one maximizes the minimum arbitrage profit 
W (OarOen — 1/Och + Orn + 2). 


More insight about the arbitrage betting condition in Equation (5) may be obtained by 
comparing it to consistent odds at each betting point. They require that 


0,05 = Q’. (10) 
The house odds favorability factors rą > 0 and r, > 0 for favorable odds are defined by 
Oan = Oall +2) and Op, = On(1 + re). (11) 

Then Equation (5) implies that 
QA Hra +r) > 1. (12) 


Note that r4 > O and rg > 0, although typical, is not required for Equation (12) see also 
Figure 2. 
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Profit 


WERT (Ay EA ae Se RR SSH Sema SRS a Ais 


max H (B) 


Maximin | Sea SSS 


hedge 
profit 


f, Strategy variable 


fnin = 1 fmax 


FIGURE 1 Profits for Teams A and B for initial betting wealth of | unit vs. strategy variable f, when 
Oan > Opn and OghOpn > !. Source: Lane and Ziemba (2004). 


Q1 > Q2> Q3 


0 rb 


FIGURE 2 House odds favorability regions. Source: Lane and Ziemba (2004). 


If ra = rẹ = r, the schedule of Q versus r and the region of betting under the perfect 
hedge is as shown in Figure 3. 

For typical values for Q of about 0.85 the required favorability for house odds quoted 
on both teams is nearly 20%. Such discrepancies occur occasionally in actual betting. 
More frequently, however, one needs to take added risk to get good bets so we now turn 
to the construction of risk arbitrages. 
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r 


House odds 
favorability, 
lg=lp=r 


Q, Payback rate 
0 0.5 1.0 


FIGURE 3 Betting region of Q vs. r. Source: Lane and Ziemba (2004). 


3. RISK ARBITRAGES 


Two approaches are considered for constructing risk arbitrage positions. They exploit 
the observed house odds favorability conditions to find good arbitrages strategies. 
A maximal capital growth model encompassing these approaches is 


unos (Un W(a)] (13) 


st È (Bali) + Beli) < Wo 
i€g 
È, BaliOzali) > a $, Boli) 
ieg ieg 
È BoliOn(i) 2 a $, Bali), Yg EG 


ieg ieg 


where E, represents mathematical expectation with respect to the game path g, G is the 
set of scenario game paths from (0, 0) to the final outcome, W (g) is the wealth with 
g, and the constant a > 0 is the relative degree of risk of the bettor in the arbitrage. 
For example, if « = 3/4, the bettor requires that total returns must cover at least 75% 
of total bets in any game. With a = 1, the goal is to find an arbitrage to cover all bets 
and a premium is required for betting in any game if a > 1. The log utility function 
corresponds to the Kelly (1956) system of betting which maximizes the asymptotic 
long run rate of growth of the bettor’s fortune. See MacLean and Ziemba (2006), Thorp 
(2006), and Ziemba and Ziemba (2007), for a summary of results concerning such 
betting strategies. 

The first approach to the risk arbitrage problem is a model that analyzes the objec- 
tive function over all feasible paths of the game and computes the bets B,, B, which 
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TABLE 3 Cardinality of G is Large 


Game 
Points in Total betting points Number of 
the game betting points Min Max game paths 


1 1 1 1 2 
2 4 2 3 6 
3 9 3 5 20 
4 16 4 7 70 
5 25 5 9 252 
30 900 30 59 5.91 x 10!6 


maximize Equation (13). This presupposes that information is known in advance 
about the odds Oan and Obr, actually set by the house throughout the game. This 
information may be a probability distribution or a function for Oan and Og,, over the 
scores of the game. The drawback is that the cardinality of G is very large as shown in 
Table 3. Calculations for two and three point games using a = 1, Q = 0.85, and p = 0.5 
appear in Table 4. These calculations utilize the following probability distributions for 
the house odds favorability factors rg = r, = r: 


1. r ~ N(0,o7) is iid for all scores of the game. 
2. r~ N[r(i),o7], depends upon the size of the lead and the number of points the 
leader is away from winning the game. 


This relation is given by 


ri) = exp(—M/D)-— 1 if teami is leading by M points 
exp(pM/D)-—1 __ if team i is trailing by M points, 


D is the number of points the leader is away from winning the game, namely K — S, 
where K are the points needed to win the game and S is the score of the leader. 

The results yield insights that may be useful in the construction of good heuristic 
strategies for 30 point games. For the normally distributed favorability factors, there is 
an intrinsic threshold value for the odds favorability below which no initial bet is placed. 
This threshold is about a 20% odds favorability. Secondly, betting almost always takes 
place in natural-arbitrage pairs, where a bet on one team at one point of a particular 
game path is paired with a compensating bet on the other team later in the game along 
the same game path. Where perfect arbitrages could be constructed, these dominated 
all other possible bet points. As variance increases, the number of lucrative bets also 
increases both in number and in size of bet. 
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TABLE 4 Sample Results for Two and Three Point Games 


House Games bet per Average percent of bet Average value of 
odds type W œ games played compared to game bet funds per game played 
Two point games 
(1) 1 0 O/1 0 0 
1 l 1/10 100 0.0005 
l= 32 5/10 84 0.0073 
(2) 1 0 0/1 0 0 
1 cl 0/10 0 0 
I-32 2/10 100 0.0119 
Three point games 
(1) 1 0 oft 0 0 
Lo al 0/10 0 0 
1 2 7/10 80 0.0193 
(2) t 0 0/1 0 0 
Me ad 3/10 75 0.0098 
‘ey 4 5/10 60 0.0059 


Source: Lane and Ziemba (2004). 


Many betting points occurred in pairs of tied scores and trailing team bets, where 
the higher odds for the trailing team boost the combined bet pair over the arbitrage 
condition requirements. Under these situations, gains could be realized on a team that 
came from behind to win, and strategies can concentrate on this possible event occurring 
while hedging a priori that the leading team wins. 

In the exponential function distribution, the high relative favorability of the trailing 
team contributes to still greater emphasis on betting on the trailer to come from behind 
to win while hedging (usually early in the game, or at a tied point) on the leader to win. 
As in the normal case, the initial bet favorability threshold value of 20% continues to 
manifest itself in the results. 

Additional calculations showed modest expected total gains in the range of 1-2% 
using this method for sets of 25 games. The gains in the three point game are larger 
than the two point game for the same level of variance, which suggests that higher gains 
could be anticipated as the game size increases and more betting opportunities arise. 

We now utilize these insights in a second approach to risk arbitrage by constructing 
single arbitrage bets for the 30 point game. A single arbitrage is a bet on one team and 
subsequently a bet on the opposing team later on in the game such that the constraints 
of Equation (13) hold. Unlike arbitrage, which required betting both teams at the same 
time, this risk arbitrage does not require concurrent bets. The idea is to exploit the 
favorability of quoted odds on one team at some point in the game and take the risk 
that the house odds will become attractive enough on the opposing team later on so 
that an arbitrage may be constructed. However, the second half of this bet will not 
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always materialize. Volatility and prediction models are useful here. This hedge may be 
formulated by examining the constraints on the bets B and 


Ba(Sa)Oan(Sa) 2 aBs(Sa), a > 0 


(14) 
ByCS5)Oon(S4) > Ba Sp), 
where S, and S, are the scores of the game at the time a bet is made on team A and B, 
respectively. 

The approach taken in the construction of the single hedge is to simulate the passage 
of the 30 point game using the same assumptions about odds favorability as with the 
two and three point game model. 

The simulation model of the 30 point game generates at each point an odds favora- 
bility for each team, and the winner of the point using a uniform distribution on p. Bets 
were generated by first initializing a threshold favorability value which is used as the 
basis for placing the initial bet. Once the initial bet is placed whose amount is deter- 
mined by criteria discussed below, the game continues until a new betting point for the 
opposing team is found such that the arbitrage condition in Equation (5) is satisfied. At 
the end of the game the bets are settled and the results recorded. Each simulation run is 
a set of 30 point games. 

To find better strategies, and study the sensitivity of the model, different sets of games 
were simulated and compared in order to find the threshold favorability values which 
gave consistent positive expected net gains over the entire set of games. The results 
are highly sensitive to the value of this parameter. Low threshold values typically mean 
early betting points and more likely completion of the single hedge. However, early bets 
typically mean low odds and thus a small hedge margin (the amount by which OarObn 
actually exceeds 1) and consequently smaller net gains. Higher values mean delayed 
betting points with a greater chance of not completing the second half of the hedge. 
However, potential loss due to unhedged or single bet games is compensated for by 
higher hedge margins (due to later game scores, larger point spreads and higher odds) 
and hence more potential net gains when the risk arbitrage is successfully completed. 
Typical results for these single arbitrage betting pairs appear in Table 5. In two games, 
the risk arbitrage was completed. One leads to a positive gain, the other to a break even 
situation. In the other game, the risk arbitrage was not completed and leads to a loss. 
If a risk arbitrage is not completed, it invariably leads to a loss because the team that you 
would like to bet on to complete the quasi-hedge remains ahead throughout the game. 

Figure 4 gives a profile of three different sets of games under various initial bet 
threshold favorability values for the normally distributed case. 

In the cases illustrated and described above, the implicit assumption in the con- 
struction of the hedge and the determination of the amounts bet was that the arbitrage 
condition in Equation (5) is satisfied as an equality, that is, OarOsa = 1. Thus the arbi- 
trage margin was assumed to be zero in the construction of the arbitrage bet pair. 
However, if a margin, m > 0 exists, such that 


OgnOp, = 1 +m 
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TABLES Sample Results for Single Risk Arbitrage Betting Pairs 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
10 1. B 0.8581 18 23 0.1288 0.1654 0.2838 
2 A 0.1419 6.2963 7.6719 0.2185 
Final 
score Bets Payoff 


Team A 31 0.1419 1.0886 
Team B 29 0.8581 —0.8531 
Net betting payoff = 0.2304 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
11* l. A 0.0306 14 23 25.7480 31.6751 0.2302 
Final 
score Bets Payoff 
Team A 15 0.0306 —0.0306 
Team B 31 0.0 0.0 


Net betting payoff = —0.0306 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
21 k B 0.6493 0 3 0.4466 0.5401 0.2094 
2. A 0.3507 l 5 2.0653 1.9782 —0.0422 
Final 
score Bets Payoff 


Team A 17 0.3507  -0.3507 
Team B 30 0.6493 0.3507 
Net betting payoff = 0.0000 


* Arbitrage condition not realized, only one bet placed. Source: Lane and Ziemba (2004). 


then the effect is to permit bets to place more emphasis on one team or the other or 
neither within the limits of satisfying the constraint set of Equation (13) (see Figure 1 
for the arbitrage case). The perceived existence of a risk arbitrage margin allows the 
bettor to express the bet as a function of the score at the time of the bet placement, or as 
merely a prediction about the eventual outcome of the game. The actual margin cannot 
be known until the arbitrage condition is satisfied at the second betting point. However, 
because the value of the margin affects the initial bet, it must be anticipated by some 
estimate. 

If a margin exists, then the arbitrage, if it has been successfully constructed, satisfies 
all the conditions required for a member of the infinite set of betting pairs defined by 
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Total f 
net a 
gain coe 


0 Threshold 


20 


s po favorability 
Positive total 
net gain region 
common to all 
games sets 
Set C 
FIGURE 4 Thirty point game, single risk arbitrage simulation ro search. Source: Lane and Ziemba 
(2004). 
TABLE 6 Total Gain Values for a 100 Game Set 
fi min f* fi max 
m/ro 0 0.05 0.10 0.20 0 0.05 0.10 0.20 0 0.05 0.10 0.20 
0 1.41 0.47 -0.22 2.22 I4 0.47 —0.22 2.22 14l 0.47 -0.22 2.22 


0.05 2.22 1.09 -0.02 3.35 2.01 0.93 -0.05 3.50 1.92 0.86 -0.10 3.89 
0.10 2.67 2.43 -0.02 3.18 2.32 2.25 -0.08 3.62 2.07 1.92 —0.15 4.27 
0.20 1.19 0.08 -147 3.09 0.78 —0.03 -1.50 409 0.17 -0.68 -1.57 5.17 


NOTE: Exponential function favorability, o = 0.2, constant strategy. Source: Lane and Ziemba (2004). 


the single betting strategy variable, f of Figure 1. However, if the margin should not 
materialize, then the risk is that the risk arbitrage will not be completed and the game 
may end with an unpaired bet that may be lost. If on the other hand the anticipated mar- 
gin understates the actual margin, then there is an opportunity loss due to the wrongly 
specified betting split. Hence even though the bet is not lost, we could have done better 
had we known the exact margin value. 

This model was used in this second way to search for best values for the anticipated 
margin factor, m and the threshold favorability factor, ro values for the different sets of 
games described above. In the first instance, a constant betting strategy was employed 
independent of the score at the time of the initial bet. The strategy variable used here 
corresponds to the upper and lower limits of f, as well as f*. 

Typical results for various m values are in Table 6. 
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When the anticipated margin is zero, the results of the three strategies are identical 
since the zero margin assumption uniquely defines the bets for each team (given by 
f=. 

For some (m, ro) pairs, the fmin Strategy yields better results while for other pairs fmax 
is superior. The f* results always take intermediate positions between fmin and fmax for 
all (m,r). The results for all three strategies however are not significantly different for 
any (m, ro). This performance is not unexpected and for larger game sets it is expected 
that there would be less difference between the results of these three strategies. 

As m increases for constant ro, the total gain rises and then falls off as the second half 
of the bet pair is more difficult to complete. Low ro returns are, in general, positive with 
low variance since bets are placed early and the pairs are completed early, generally with 
low odds. High ro values yield better mean returns with larger variances. Intermediate 
ro values reduce the returns. The results of this strategy for the same set of games used 
in Table 6 appear in Table 7. 

Comparing the automatic versus constant variable strategies shows that the automatic 
strategy dampens the extreme results of the fmin and fmax strategies of Table 6 while 
improving on the more conservative results of f*. While these results may not be consid- 
ered as being significantly different, the trend of the automatic strategy is toward a more 
stable and profitable outcome. More importantly, the variance of the expected gains is 
reduced by about a third from six to four over the comparable results in Table 6. Finally, 
the automatic strategy, being score dependent, is more intuitive and appealing and is the 
preferred policy for the single hedge construction problem in the 30 point game. 

While single risk arbitrage jai alai results are encouraging, the gains that occur for 
the simulated game sets are modest. There is greater potential for larger gains in longer 
games as there is more opportunity for inefficiency to manifest itself through volatility. 

Combining the results of the two risk arbitrages, our final analysis examines the 
policy of betting over the 30 point game through the construction of a series of arbitrage 
bet pairs. 

The simulation model discussed above was modified to accommodate a series of 
single arbitrage bets. The same amount was assumed available for each arbitrage bet 
pair with a maximum total bet availability requirement of 60 betting units. 


TABLE 7 Single Hedge Construction; Total Gain Over 100 
Games Sets Using an Automatic Strategy Variable 


Threshold favorability. ro 


m 0 0.05 0.10 0.20 
0 1.41 0.47 —0.22 2.22 
0.05 2.11 1.03 —0.02 3.55 
0.10 2.49 2.33 0.02 3.61 
0.20 0.94 0.02 —1.24 3.12 


Source: Lane and Ziemba (2004). 
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TABLE 8 Betting Summary: Multiple Quasi-Hedge Bets 


Amount Score Actual House 
Bet Team bet A B odds odds Favorability 
1 A 0.5459 ! 0 0.6894 0.8733 0.2668 
1 B 0.4541 l 0 1.048 1.2047 0.1495 
2 A 0.523 1 1 0.85 0.9356 0.1007 
2 B 0.477 4 2 1.3188 1.4157 0.0735 
3 A 0.535 2 1 0.6869 0.9127 0.3288 
3 B 0.465 2 t 1.0519 1.16 0.1028 
4 A 0.6342 4 2 0.5479 0.6057 0.1055 
4 B 0.3658 9 6 1759 1.7204 ~0.0219 
5 B 0.3139 10 7 1.7899 2.1853 0.2209 
5 A 0.6861 10 9 0.6606 0.7306 0.1059 
6 B 0.3855 11 9 1.4189 1.5938 0.1233 
6 A 0.6145 12 11 0.6517 0.6392 —0.0192 
7 B 0.3167 12 10 1.4385 2.1578 0.5 
7 A 0.6833 12 10 0.5022 0.5551 0.1052 
8 A 0.4782 12 12 0.85 1.1168 0.3139 
8 B 0.5218 25 24 1.4049 1.0277 —0.2685 
9 A 0.2919 19 21 1.7751 2.4262 0.3668 
9 B 0.7081 19 21 0.407 0.4617 0.1344 
10 A 0.2301 19 22 2.6866 3.3467 0.2457 
10 B 0.7699 21 23 0.3706 0.338 —0.088 
11 A 0.4152 22 23 1.3005 1.4085 0.0831 
11 B 0.5848 23 23 0.85 0.7379 —0.1319 
12 A 0.4123 23 24 1.3452 1.4254 0.0596 
12 B 0.5877 25 25 0.85 0.8935 0.0512 
13 B 0.4613 24 24 0.85 1.1942 0.4049 
13 A 0.5387 24 24 0.85 0.9764 0.1487 
14 A 0.4969 25 25 0.85 1.0372 0.2203 
14 B 0.5031 26 25 1.4898 1.5992 0.0734 
15 B 0.5303 26 26 0.85 0.9089 0.0693 
15 A 0.4697 28 29 255 3.3276 0.3049 
16 B 0.508 29 29 0.85 0.9926 0.1678 
Final 
score Bets Payoff 
Team A 29 7.5549  —7.5549 
TeamB 30 7.9532 8.7968 


NOTE: Net betting payoff = 1.2419. Source: Lane and Ziemba (2004). 
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TABLE 9 Exponential Function Favorability, 
g = 0.3; 100 Game Set Hedge Series Construction 


Threshold favorability, ro 


m 0 0.1 0.2 

0 -1.96 2.12 11.29 17.21 
0.05 —0.76 4.51 13.27 19.12 
0.1 0.79 3.77 10.34 17.26 
0.2 2.83 4.74 8.19 14.17 


Source: Lane and Ziemba (2004). 


The algorithm proceeds by requiring that the preset threshold favorability factor 
value is satisfied before the initial bet of any pair is made. Priority over initializing the 
first half of a new risk arbitrage is given to matching unmatched hedges. The automatic 
strategy policy is used for the construction of all pairs. 

Table 8 describes the bets in a typical game where 24% of a betting unit is the 
total profit. A summary of the results of this simulation appears in Table 9. The results 
are encouraging. Depending on the anticipated margin and the threshold value chosen, 
the number of hedges completed in the simulated games can range from 1 to 20 with 
never more than three uncompleted bet pairs among this series. Losses may be incurred 
on any particular game under this imperfect hedge strategy due to the possibility of 
uncompleted bet pairs. Such losses never exceeded one betting unit for any of the sim- 
ulated games, whereas the single game gain ranged as high as three units. Game sets 
are divided approximately 60/40 in terms of winning to losing games for the sets exam- 
ined. The total gain over the entire set of games is increased relative to the single hedge 
strategy and the instance of loss is reduced. As the odds favorability variance increases, 
the potential for more profitable bets occurs and the expected gains and variance rise 
accordingly, positive net gains occur regularly for similar values of the standard devia- 
tion of the odds favorability distribution. The variance of the expected gains are larger 
than for the single hedge case (40 versus 6) as expected. 


4. FINAL REMARKS 


It is possible to construct profitable arbitrage strategies for the 30 point jai alai game. 
Modest returns may be realized under strategies of arbitrage and risk arbitrage sin- 
gle bet pair constructions. Mathematical programming results imply that series of bet 
pairs may be optimal for games of this kind. Simulation results suggest that improved 
gains may be obtained under such strategies where bet placements are dependent on the 
favorability of quoted odds and the score. Further analysis of this situation might con- 
centrate on more detailed investigation into the actual distribution of quoted house odds 
during the game. This will involve more intensive data collection at jai alai frontons. 
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Data collection in Mexico City indicates that there are substantial inefficiencies. The 
formulation here has assumed score invariant single point win probabilities. A more 
refined but possibly unmanageable analysis might consider score variant probabilities 
possibly using Markov chains. More study should also be undertaken with respect to 
the reaction of the oddsmakers to shifts in score and an examination of their individ- 
ual objective functions. Parayre (1986) has some results along these lines for win and 
perfecta bets based on player strength and post positions. These results follow ideas in 
Ziemba and Hausch (1986). 

The methodology and insights found for team jai alai also have potential applications 
in other situations where one has non-marketable financial instruments. These include 
certain horse racing (especially on betting exchanges), currency exposure, and produc- 
tion situations. Risk arbitrage in traded options and warrants markets is an additional 
example. See Shaw et al. (1995) for one such application related to the Japanese Nikkei 
put warrant in 1989-1990. 

In England and other European and Commonwealth countries, legalized bookies set 
odds that various horses will win a given race both on-course and off-course. These odds 
may differ across bookies at a particular moment in time. The odds change during the 
20 or so minutes before a race is run as opinions are altered in light of new information 
such as the horses’ appearances and because the bookies would like to simultaneously 
balance their books to guarantee a profit no matter what horse wins, and maximize 
the number of tickets sold. This situation, from the bettor’s perspective, mirrors the 
jai alai situation, once extended to multiple outcomes, assuming that he has an indepen- 
dent estimate of the probability that each horse will win obtained by a handicapping or 
statistical procedure. 

A key feature of the team jai alai and racing situations is that the tickets once pur- 
chased are not marketable except perhaps at a substantial discount. Other situations 
share these features and we will describe two of them briefly here. 

Consider a company with substantial foreign accounts receivable at a future date. The 
standard way to hedge against possible devaluations is through a futures contract in the 
country’s currency. However, in many cases this is not possible because the currency 
does not have an active futures market or the time horizon is too long. The curren- 
cies of Italy, Thailand, and Turkey are examples of the former. Even for established 
heavily traded currencies such as the Euro and the Mexican Peso, such contracts will 
not cover a multiple year exposure. Negotiations with a bank might produce a spe- 
cial forward contract for part of the exposure. Such a contract would be difficult to 
sell except at a substantial discount. As time goes on, the company may add additional 
contracts to cover more of the exposure with the original or other banks. In terms of the 
jai alai formulation, one may think of the original exposure and any subsequent accounts 
receivable as bets on A and the covering as bets on B. 

Farmers often have fixed contracts for delivery of the crops from their acreage at a 
specified time. Both the price he or she will receive and quantity he or she will have 
available are likely uncertain. In a publicly traded commodity such as corn or wheat 
he or she could hedge against these uncertainties. However, active futures markets are 
not available for most commodities. Lettuce and raspberries are two such examples. 
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The farmer can consider his or her crop as bets on A and contracts he or she makes with 
other farmers of specific quantitites at fixed prices as bets on B. 

Some analyses of problems similar to these two examples using hedging arguments 
for static problems appear in Anderson and Danthine (1981), Feiger and Jacquillat 
(1979), McKinnon (1967), and Rolfo (1980), and for a two period problem in which 
additional information becomes available, see Baesel and Grant (1982). 
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WILLIAM T. ZIEMBA*: The evidence is very strong for the existence of a turn- 
of-the-year effect. The prices of small-capitalized stocks increase significantly 
relative to large-capitalized stocks on the first ten or so trading days of January. 
The effect seems to begin on trading day —1 where there is a tendency for a shift 
in sales of the small stocks at the asked rather than at the bid. Trading days +1 
to +4, on average, show enormous gains in the small stocks over the large stocks 
and this effect continues until mid-January. The total average difference between 
the smallest and largest deciles of stocks from —1 to +9 is on the order of 6 to 10 
percent. Studies supporting this effect use data over extremely long periods. For 
example, the original Rozeff-Kinney study concerned the period 1904-74. The 
period from 1962 to present has been especially thoroughly analyzed. One 
additional useful empirical study not referenced by Ritter is Smidt and Stewart 
[7]. 

Why does this effect occur and with such regularity? Jay Ritter’s excellent 
paper provides us with more insight into the understanding of this phenomenon 
with an analysis of the “parking-the-proceeds” hypothesis. Using a unique data 
set for the fifteen turns of the years from 1971 to 1985, he finds that the buy/ 
sell ratio of individual investors at Merrill Lynch explains 46% of the variance 
of the excess returns of small over big stocks in the first half of January. During 


* Faculty of Commerce and Business Administration, University of British Columbia, and Institute 
of Socio-Economic Planning, University of Tsukuba, Japan. Without implicating him, I would like 
to thank Jay Ritter for comments on an earlier draft of this discussion. 
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this period the small stocks gained 8.17% more than the big stocks on days +1 
to +9 or nearly 1% per day. Figure 1 visually shows the argument in a convincing 
manner. 

The “parking-the-proceeds” argument is based on three suppositions. First, when 
individuals buy stocks, they buy disproportionaly more small stocks than big 
stocks relative to institutional investors. Some supporting evidence is presented, 
and this hypothesis is reasonable since it is these very investors who have loaded 
up the large mutual and pension funds with the vast bulk of the market wealth, 
and these institutions do seem to concentrate on the large-capitalized stocks 
since in many cases they are judged on their performance relative to the S&P 
500. Moreover, the individual investors (especially those well healed enough to 
have accounts at Merrill Lynch) likely buy more small stocks in their personal 
accounts to diversify. Secondly, the price of these small stocks is affected by 
buying pressure—again reasonable as demonstrated by Stoll and Whaley [8] and 
others. Finally, individuals are net buyers in early January because of the proceeds 
remaining from December’s tax-motivated sales, and their purchases are large 
enough to move the market. The arguments that individuals sell in December to 
take losses, that they like to see the money in their account at the end of 
December before repurchasing in January, and that they receive pay and bonuses 
starting on day —1 that can be reinvested appear sound. Ritter’s data indicate 
that relative to the rest of the year individuals are net buyers in January but that 
in absolute terms they have been net sellers for each of the fifteen years in the 
sample. It is likely that the net selling is movement into mutual funds and away 
from individual accounts. Perhaps, as he suggests, in January these individuals 
are switching into small stocks and out of big stocks with capital gains. This 
needs to be checked. There is an analogy of the late-charging horse running 
faster to nip a front runner at the wire when in fact the charger is not accelerating 
at all. The horse is simply declerating at a slower rate. 

Obviously there are a number of factors that lead to the turn-of-the-year effect. 
The studies of Ariel [1] with spot data for 1963-1981 and Sick and Ziemba [6] 
with futures data from 1982-1988 show that throughout the year the small stocks 
outperform the big stocks in the trading interval —1 to +9 and particularly —1 
to +4. See also Keim and Smirlock [4]. The arrival of new money from income, 
bonuses, parked proceeds and other sources seems to fuel the net purchase of 
small stocks relative to big stocks in this period. 

The data also seem to support the hypothesis that the turn-of-the-year effect 
is strongest following a bear market. January 1988 following the October 1987 
crash was certainly a strong example of this. 

Why does the market not respond to this effect to eliminate it as it should in 
an efficient market? In fact, at least for the turn of the year, it does seem to do 
this in the futures markets. Clark and Ziemba [2] have shown that the futures 
price of the Value Line Composite begins to anticipate the turn-of-the-year effect 
around the middle of December. Moreover, the extent of this anticipation is 
capturing more and more of the expected gain of the turn-of-the-year effect as 
information about the effect becomes more widely known partly through trade 
books like Haugen and Lakonishok [3] and Ziemba [11]. The expected gain in 
the futures market of the Value Line Composite over the S&P 500 in the mid- 
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December to mid-January period has averaged about 2 to 3 percent in the past 
10+ years, with about two thirds of this being anticipated. 

I am in agreement with Ritter that institutional factors such as he discussed 
and the flow of funds at the turns of months and particularly at the turn of the 
year are among the main causes of the bidding up of the small stocks at these 
times. It is very useful to have reliable risk-return equilibrium relationships to 
explain asset prices. Even the substantial increase in risk during January found 
by Tinic and West [9] and Ritter and Chopra [5] does not explain the effect. 
The latter paper also concludes that risk is rewarded according to the CAPM 
only in January and then only for small stocks. Otherwise, this popular model 
does not explain the risk-return relationship. 

Theoretical studies such as Williams’ [10] asymmetric-information model, 
using better-informed inside traders and tax sellers that yields a bounce back of 
prices in January once the tax selling is completed and the risk premiums fall 
back in line, will be useful to piece together the missing 54% of the explanation. 
However, it does seem that the turn-of-the-year effect is caused by a number of 
factors coming together at one time and that they are largely institutional. Hence 
it will be hard for any one model to be the sole explanation of the effect. 
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1 


Introduction 


This paper explores an interesting question—whether January equity returns 
forecast the direction of the returns in the rest of the year. Yale Hirsch (1986) 
calls this the January barometer. The supposition is that: 

If the market rises in January, then it will also rise during the rest of the year, 
but if it falls in January, then there will be a decline during the rest of that year. 

Hirsch (1994) in his yearly “Stock Traders Almanac” actually reports results 
on the hypothesis “if January rises then that year rises.” While Hirsch includes 
January in the returns for the year, we believe that January returns should be 
excluded. Therefore, this study uses the italicized definition above, which 
excludes January returns and examines the rest of the year returns separately. 
This methodology provides for a clearer statistical test and, we believe, more 
closely represents the essence of the hypothesis. 

In another paper (Hensel and Ziemba (1995)), we investigated the US case 
more fully. Data on the S&P 500 during the 68-year period 1926-1993 strongly 
support the first part of the hypothesis, especially from 1940-1993. However, 
there was evidence that negative January returns did not have any predictive 
power for returns in the next eleven months. 

In this paper we investigate the January barometer’s predictive power in 
many worldwide equity markets including Australia, Austria, Canada, France, 
Germany, Japan, Switzerland, and the United Kingdom during the period 1970- 
1993. The results indicate that positive Januarys have good predictive power but 
that indices aggregated across various regions, such as Europe or Pacific, have 
stronger predictive power. As in the US, negative Januarys have no predictive 
power. 


2 The US Evidence 


We investigated the January barometer in a related paper using monthly total 
return S&P 500 data from Ibbotson Associates, for the 68-year period: January 
1926 to December 1993.! 

The results in Table 1, especially from 1940-1993, strongly support the first 
part of the January barometer hypothesis, namely that, if the market rises in 
January then it will also rise during the rest of the year. During this latter 54-year 
period (bottom two lines of Table 1), January returns correctly predicted the 
direction of the rest-of-the-year returns 75.9% of the time. When the return in 
January was positive, the rest of the year had positive returns 91.2% of the time. 
However, when January returns were negative, the rest of the year had positive 
returns in only 50.0% of the years. The 91.2% was significantly higher than the 
50.0% at the 1% level with a two-tailed test. 

The barometer predicted worse than chance during the 1926-1939 period 
when positive Januarys resulted in positive rest of the years only 37.5% of the 
time (Table 1) versus 57.1% positive rest of the years during this period. That is, a 
model which always predicted positive rest of the years would have been correct 
over 57% of the time during 1926-1939. By contrast, the January barometer 
correctly predicted positive rest of the years 37.5% of the time—a very poor 
showing. However, the barometer was a good predictor in each of the following 
five decades. These five decades had success rates greater than 80% for positive 
January returns. Two decades, the 1950s and 1970s, had success rates of 100% 
for positive Januarys. 


1 Please see Hensel and Ziemba (1995) for the full results. 
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Period 
1926-1993 


1926-1939 
1940-1949 
1950-1959 
1960-1969 
1970-1979 
1980-1993 


1940-1993 
N (1940-1993) 


2 The US Evidence 


Table 1 


Success of the January Barometer in the US and Average Gains 


and Losses for February through December, 1926-1993 


January Barometer Success Average Feb-Dec Loss Average Feb-Dec Gain 

AllJans. Jan. Up Jan. Down  JanDn,RstYrDn JanUp,Rst¥rDn JanUp,RstYrUp JanDn,RstYrUp 
66.2% 81.0% 42.3% -8.1% -25.4% 18.4% 15.5% 
28.6 37.5 16.7 -5.8 -33.9 30.2 26.2 
80.0 85.7 66.7 -7.2 -15.3 16.4 9.2 
90.0 100.0 66.7 -3.9 0.0 23.0 9.9 
70.0 83.3 50.0 -6.8 -11.2 13.6 11.3 
80.0 100.0 60.0 -15.5 0.0 13.5 12.1 
64.3 88.9 20.0 -0.6 -7.5 17.4 8.9 
75.9% 91.2% 50.0% -8.3% -11.3% 17.2% 10.2% 
54 34 20 10 3 31 10 


Source: Hensel and Ziemba (1995) 


The barometer, when positive, has also been a signal for the magnitude of the 
February to December gains. For example, during 1940-1993 when the January 
barometer was positive and suggested a gain during the rest of the year and there 
actually was a gain in these eleven months, it was significantly higher (at 17.2%) 
than the average return (10.2%) during years when the barometer forecasted a loss 
and there was a gain (bottom of right panel, Table 1). This difference was 
significantly positive with a one tail t test at a significance level of 0.15%. 

When the barometer was negative and suggested a weak rest of the year and 
there actually was a loss during the rest of the year, that loss averaged -8.3%. This 
return was not statistically different from the -11.3% when the barometer was 
positive and the forecast failed (bottom of middle panel, Table 1). Also, during 
this 54-year period, when January was negative the rest of the year was negative 
50.0% of the time. This compares to negative rest of the year returns in 24.1% of 
the years. Thus the barometer predicted more frequent negative returns than 
actually occurred, and was not a useful predictor when there were negative 
Januarys. 

We concluded that since 1940 the January barometer, when positive, has 
provided a statistically significant signal that both the probability that a gain will 
occur in the rest of the year is higher than average, and the size of that gain, if it 
occurs, will be above average. A negative January barometer has provided no 
information concerning the rest-of-the-year’s returns.” 

These results are consistent with the hypothesis, discussed, for example, by 
Schwadel (1988), that the returns in January are dependent upon current 
economic activity such as the Christmas sales period in December. If such 
economic activity is high then January stock prices as well as the rest-of-the- 
year’s prices likely will rise. 


2 The success of the January barometer is not the result of January returns, on average, being larger 
than other months. Over this 68-year period, three other months had average returns greater than 
January. A t test comparing the differences in mean returns, across all months, produced no significant 
results. 
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3 Worldwide Evidence 


To study the January barometer in other worldwide markets and regions we 
utilized the Morgan Stanley Capital International Indices with monthly total 
returns for the twenty-four year period 1970-1993. For individual countries we 
used local currency returns for this research. However, for indices aggregated 
across countries it seemed more appropriate to focus on a single currency; we 
used the US dollar for this purpose. 

We investigated the January barometer’s success in local currency returns for 
the nine individual countries: Australia, Austria, Canada, France, Germany, 
Japan, Switzerland, the United Kingdom plus the United States; and in US dollar 
returns for the four aggregated regions: Europe, Pacific, EAFE and the World 
Indices. 


Table 2 
Success of the January Barometer in Nine Countries 
and Four Aggregated Regions, 1970-1993 


MSCI Country Indices 


in Local Currency All Jans. Jan. Up Jan. Down 
Australia 66.7% 78.6% 50.0% 
Austria 50.0 58.3 41.7 
Canada 58.3 80.0 22.2 
France 54.2 64.7 28.6 
Germany 58.3 63.2 40.0 
Japan 70.8 81.3 50.0 
Switzerland 70.8 76.5 57.1 
United Kingdom 83.3 94.1 57.1 
United States 66.7 92.9 30.0 
MSCI Regional Indices 
in US Dollars 
Europe 79.2% 78.9% 80.0% 
Pacific 75.0 82.4 57.1 
EAFE 70.8 82.4 42.9 
World 70.8 -87.5 37.5 


In general, we have the same conclusion as in the US (with the 1926-1993 
data). If January is positive, then the rest of the year is positive a high percentage 
of the time. For example, this percentage averaged 74.6% for the eight countries, 
not including the US. The percentage was higher for the Europe (78.9%), Pacific 
(82.4%), EAFE (82.4%) and World (87.5%) Indices. Similar to the US, when 
January is negative, the signal that the rest of the year is also negative is very 
weak.? Hence, the results support the predictive power of the January barometer 
for positive Januarys but not for negative Januarys. 

The results in the aggregated regions support this and are somewhat stronger. 
However, there is one notable exception. For the MSCI Europe Index the 
barometer has worked for negative Januarys as well as positive Januarys 80.0% 
and 78.9% of the time, respectively.4 


3 The success percentage averages only 43.3% for the eight countries. A statistical test indicates that 
74.6% is higher than 56.7% {negative Januarys implying positive rest of the years, 56.7% = 
100% - 43.3%), with a two-tailed test at the 5% level. Moreover, 43.3% (negative January barometer 
success) is not statistically higher than 25.4% (positive Januarys implying negative rest of the years). 

The success at predicting negative rest of the years is in contrast to the other aggregated indices 
and the individual countries. This result could be due to the small number of negative Januarys (5), 
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3 Worldwide Evidence 


To investigate the January barometer further, we present in Tables 3a-m (in 
the Appendix) the results for the individual countries and regions. An explanation 
of Table 3 is provided in Table 3a. 

Some of the main conclusions by country and region for 1970-1993 are: 

a) Australia — the barometer worked similar to the US: in 11 of the last 12 
years with positive Januarys, the returns in the rest of the year were also positive; 
negative Januarys had no predictive power; when returns were positive in 
February to December they were higher when January was positive. 

b) Austria — the barometer did predict slightly better than chance for 
positive Januarys and slightly worse than chance for negative Januarys but these 
results were not statistically significant even at the 10% level. 

c) Canada — the barometer did work for positive Januarys and provided no 
information for negative Januarys. The results were similar to those for the US 
which is not surprising because the stock returns in the two markets are highly 
correlated and the currency exchange rate has relatively low volatility. 

d & c) France and Germany — the January barometer predicted poorly in the 
1970s but very well for the positive returns in the 1980s; in neither period did the 
signal predict the level of returns better than chance. 

f) Japan — the results were similar to the US with the barometer predicting 
the probability and size of the rest-of-the-year’s returns with positive Januarys 
and providing no information for negative Januarys. 

g) Switzerland — similar to France and Germany; the barometer predicted 
poorly in the 1970s and more accurately in the 1980s; the signal for positive and 
negative Januarys gave an accurate prediction of the size of the returns in the rest 
of the year. 

h) United Kingdom — for positive Januarys, the barometer had a very high 
level of accuracy (94.1% overall and 100% in the 1982-1993 period) and gave an 
accurate prediction of the level of returns. For negative Januarys, the barometer 
did not provide a useful prediction of the chance of negative returns in the rest of 
the year. However, the barometer did provide a useful prediction of the magnitude 
of the February to December decline. 

i) US — the 1970-1993 results reported here are similar to those from 
1940-1993 reported in Hensel and Ziemba (1995) and summarized in Section 2. 

j) Europe — the MSCI Europe Index has 14 countries which include the 
five studied here plus nine others. However, the five studied here contain the 
majority of the market capitalization and hence dominate the results. The January 
barometer for the aggregated index for Europe was accurate for positive as well as 
negative Januarys (however, there were only 5 negative Januarys), and both gave 
accurate signals regarding the size of the returns. 

k) Pacific — the Pacific index is dominated by Japan and the results show 
this; the barometer predicted well both the probability and size of the rest-of-the- 
year’s returns for positive Januarys and provided no information for negative 
Januarys. 

1) EAFE — the MSCI Europe, Australia, and Far East Index results are 
similar to that of the Pacific with the barometer predicting well both the 
probability and size of the rest-of-the year’s returns for positive Januarys and 
providing no information for negative Januarys. 

m) World — the MSCI World Index is approximately the US plus EAFE 
both of which had the same result; the barometer predicted well both the 
probability and size of the rest-of-the-year’s returns for positive Januarys and 
provided no information for negative Januarys. 


possible effects introduced by measuring returns is US dollars, or through a reduction in the noise of 
each country forecast when they are aggregated into the regional index. 
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4 Final Remarks 


We do not have an adequate explanation as to why the January barometer seems 
to have predictive power when January returns are positive. There has been some 
conjecture that the January barometer might really be a first month of the 
corporate fiscal year phenomena. The most common corporate fiscal year in the 
US is the calendar year. Therefore, in countries where fiscal years and calendar 
years differ, if this conjecture is correct, we might expect to see the first month of 
the common corporate fiscal year predict the following 1! months returns better 
than January. 

We examined this possible explanation for the January barometer for three 
countries—Australia, Japan, and the UK. In Australia, many companies start their 
fiscal year in July, ending the following June. Japan and the UK have fiscal years 
from April through March. All three cases, using the first month of the fiscal year 
as an indicator for the rest of the year, performed worse than using January as the 
indicator. These results suggest, at least for these three countries, that corporate 
fiscal years were not the major factor contributing to the success of the January 
barometer. 

The actual reason why the predictive ability occurs is probably a combination 
of factors. Besides the one discussed above, and the Christmas business 
hypothesis discussed in the text, the very fact that January returns are usually high 
and are expected to be so is another possible reason. That may be why the 
barometer seems to predict well for positive Januarys (the expected result) and 
provide no information for negative Januarys. Still, the January barometer is an 
interesting and useful concept and indicator for stock investors in the US and 
other worldwide markets. 
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6 Appendix 


6 Appendix 


Table 3 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


a) Australia Index January Barometer 
Success -Success percentage of the January barometer for all 
Period All Jans. Jan. Up Jan. Down] Januarys, Januarys with positive returns, and Januarys 
with negative returns. 
-Frequency of yearly observations. 


1970-1993 66.7% 78.6% 50.0% 
N 24 14 10 : 
-Percentage of years with positive January returns. 
% Years with positive January 383 -Percentage of years with positive returns for the year. 
% Years with positive Returns 66.7 mined o years with positive rest of the year 
% Years with positive Feb-Dec 66.7 retumis (Feb DEC): 
1970-1993 


Average Feb-Dec Loss -These returns provide data on the magnitude of losses, 
JanUp, RstYrDn when there was a loss in the rest of the year (Feb-Dec), 
JanDn, RstYrDn for positive and negative January returns. 


Average Feb-Dec Gain -These returns provide data on the magnitude of gains, 
JanUp, RstYrUp when there was a gain in the rest of the year (Feb-Dec), 
JanDn, RstYrUp for positive and negative January returns. 
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Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


b) Austria Index January Barometer Success 


d) France Index January Barometer Success 


Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 

1970-1993 50.0% 58.3% 41.7% 1970-1993 54.2% 64.7% 28.6% 
N 24 12 12 N 24 17 7 
% Years with positive January 50.0 % Years with positive January 70.8 
% Years with positive Returns 62.5 % Years with positive Returns 70.8 
% Years with positive Feb-Dec 58.3 % Ycars with positive Feb-Dec 66.7 
1970-1993 1970-1993 

Average Feb-Dec Loss Average Feb-Dec Gain 
JanUp,RstYrDn -3.1% JanUp,RstYrDn -6.3% 
JanDn,RstYrDn -1.3% JanDn,RstYrDn -3.0% 

Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 15.3% JanUp,RstYrUp 12.9% 


JanDn,RstYrUp 


c) Canada Index January Barometer Success 


10.5% 


JanDn,RstYrUp 


e) Germany Index January Barometer Success 


24.7% 


Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 

1970-1993 58.3% 80.0% 1970-1993 58.3% 63.2% 40.0% 
N 24 15 N 24 19 5 
% Years with positive January 62.5 % Years with positive January 79.2 
% Years with positive Returns 75.0 % Years with positive Returns 70.8 
% Years with positive Feb-Dec 79.2 % Years with positive Feb-Dec 62.5 
1970-1993 1970-1993 

Average Feb-Dec Loss verage Feb-Dec Loss 
JanUp,RstY¥rDn -2.6% JanUp,RstYrDn -3.8% 
JanDn,RstYrDn -1.6% JanDn,RstYrDn -10.4% 

Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 10.8% JanUp,RstYrUp 12.0% 
JanDn,RstYrUp 12.4% JanDn,RstYrUp 16.7% 
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6 Appendix 
Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 
f) Japan Index January Barometer Success h) United Kingdom Index January Barometer 
Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 70.8% 81.3% 50.0% 1970-1993 83.3% 94.1% 57.1% 
N 24 16 8 N 24 17 7 
% Years with positive January 66.7 % Years with positive January 70.8 
% Years with positive Returns 75.0 % Years with positive Returns 83.3 
% Years with positive Feb-Dec 70.8 % Years with positive Feb-Dec 79.2 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp.RstYrDn -2.8% JanUp,RstYrDn -0.1% 
JanDn,RstYrDn -8.1% JanDn,RstYrDn -12.9% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 16.3% JanUp,RstYrUp 19.0% 
JanDn.RstYrUp 11.5% JanDn,RstYrUp 7.1% 


g) Switzerland Index January Barometer Success i) United States Index January Barometer 
Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 70.8% 76.5% 57.1% 1970-1993 66.7% 92.9% 30.0% 
N 24 17 7 N 24 14 10 
% Years with positive January 70.8 % Years with positive January 58.3 
% Years with positive Returns 70.8 % Years with positive Returns 79.2 
% Years with positive Feb-Dec 66.7 % Years with positive Feb-Dec 83.3 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 

JanUp,RstYrDn -3.6% JanUp,RstYrDn -0.4% 
JanDn,RstYrDn -8.4% JanDn,Rst¥rDn -4.5% 

Average Feb-Dec Gain Average Feb-Dev Gain 
JanUp,RstYrUp 14.4% JanUp,RstYrUp 14.5% 
JanDn,Rst¥rUp 5.0% JanDn,RstYrUp 7.2% 
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Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


J) Europe Index US$ January Barometer Success 1) EAFE Index US$ January Barometer Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan.Up Jan. Down 
1970-1993 79.2% 78.9% 80.0% 1970-1993 710.8% 82.4% 
N 24 19 6 N 24 17 
% Years with positive January 79.2 % Years with positive January 
% Years with positive Returns 70.8 % Years with positive Retums 
% Years with positive Feb-Dec 66.7 % Years with positive Feb-Dec 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp,RstYrDn -1.2% JanUp,RstYrDn -2.4% 
JanDn,RstYrDn -7.8% JanDn,RstYrDn -5.3% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 15.8% JanUp,RstYrUp 18.5% 
JanDn,RstYrUp 4.0% JanDn,RstYrUp 6.4% 
k) Pacific Index US$ January Barometer Success m) World Index US$ January Barometer Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 75.0% 82.4% 57.1% 1970-1993 70.8% 87.5% 37.5% 
N 24 17 7 N 24 16 8 
% Years with positive January 70.8 % Years with positive January 66.7 
% Years with positive Returns 70.8 % Years with positive Returns 75.0 
% Years with positive Feb-Dec 70.8 % Years with positive Feb-Dec 79.2 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp,RstYrDn -3.0% JanUp,RstYrDn -2.2% 
JanDn,RstYrDn -7.5% JanDn,RstYrDn -4.2% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 22.6% JanUp,RstYrUp 14.5% 
JanDn,RstYrUp 10.9% JanDn,RstYrUp 5.1% 
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Occupational Nostalgia #3 
Bill Ziemba: 


lhad been focusing my research 
on stochastic programming, that’s 
optimization with scenarios and 
portfolio theory, Earlier | applied 
these ideas to horseracing and 
devised a now well-known sys- 
tem to beat the place and show 
markets, The idea was to use probabilities from a simple 
market, namely win, to price more complex markets like 
place, show, and exotics. And bet with the Kelly crite- 
rion. That’s led to a number of books that the racetrack 
hedge fund syndicates use, See Amazon for titles, | got 
an invitation to go to Japan and after an interview con- 
structed a deal to be the first Yamaichi Visiting Professor 
of Finance and consultant two days of the week to the 
Yamaichi Research Institute. The Leaching was good, 
doing futures and options and other financial subjects. 
The students were from the University with a number 
commuting up to the University of Tsukuba, which is a 
National University. | asked for the consulting to be in 
two study groups on Thursdays and Fridays on stock 
market anomalies end stock market crashes. It was 1988 
and they were flush with money, My wife Sandra, a keen 
economist who knows real economics, believed that as 4 
Creditor nation they were in trouble and would figure out 
a way to lose the money. She was right, and in the end 
they lost 5 trillion on stocks and 5 trillion on land, both 
items they already owned, So they simply pushed up their 
Own assets with only 3 percent of their assets abroad 

~ our book Power Japan details this. In the study groups 
we found a good crash model that predicted 12 out of 12 
of the 10+ percent crashes inthe Japanese market from 
1948-1988. These were interest rate crashes relative to 
esmings yield, so | had the model when the long bond 
fate minus the reciprocal of the price earnings ratio was 
too high. Then in the fight for the asset allocation money 
Went to bands and stocks fell. | called this the bond 


stock crash measure, And | did publish it in 1991 in the 
book Invest Japan, With the Japanese market going up 
221 times in yen and 550 times in dollars in this 40-year 
period there were 20 such 10+ percent declines. So the 
mode! did not predict all crashes but it did work when the 
signal went into the danger zone. The idea was that if it 
entered the danger zone then it was very likely that a 10+ 
percent decline would occur in the next year. | learned 
this by looking at the 1987 worldwide stock market crash 
~ there the market went into the danger zone in April 
then fell in October. So the market can continue rallying 
but the signal indicates that a big decline will occur. The 
measure called nicely the 2000 US crash, with the signal 
showing up in April 1999 a year before the start of the 
decline. Recall there was a drop in April then a rally back 
toa similar high in August 2000 then a decline of big 
proportions into 2001. Then stocks fell but earnings fell 
more in early 2002, so the mode! suggested another crash 
that occurred in 2002 with the market falling 22 percent 
and 12 percent of this in the July to September quarter. 
That's in the USA. | might add that this model predicted 
the 2008 crashes in China and Iceland because they were 
interest rate crashes but it did not predict the 2007-2010 
trouble period which was not a high interest rate crash. 
This measure ina ratio logarithmic form is known in the 
trade as the FED model from a 1996 report. Of course, it’s 
in our 1991 book, so | hope someday to get some credit 
here. Getting back to Japan, | noted at the end of 1989 
that Lhe market was in the danger zone. Getting a close 
prediction ona crash of an expensive overheated market 
is tough. Ask George Soros, who shorted Japan in 1988 
and lost a lot. But then the bond-stock model was not 

in the danger zone as stack prices were high but interest 
fates were not high enough to reach the danger zone. 
But! was not consulting for Soros, rather for Yamaichi 
The firm was nice to me, and my family. My now rather 
famous daughter Rachel, who has become a star analyst 
working with Nouriel Roubini, was nine then and went to 
Japanese public school (in Japanese), Rachel has many 
talents and languages are one of them, so she picked up 


the Japanese fast ~ some from after-school play with 
other youngsters, My wife had a nice teaching job and 
we were taken to nice dinners and other good events. | 
was taken for fugu and golf ~ yes, | was able to come last 
but not much below the 3rd finisher - so the protocol 
was handled okay. Getting back to the crash model - | 
decided to tell them about this through lishi, a rather 
good helper with perfect English (MBA from Yale) and 
perfect Japanese. He had helped me with the calcula - 
tions and wes sort of a de facto leader of the students. 
While | had given a number of talks to large audiences 
and had a whole year of talks in English-Japanese with 
Mr. Okada, one of the higher ups, and they supported 
my research ~ including a rather good 30-factor model 
of the Japanese first section — there was no way they 
would believe me and lishi that the market heading to its 
peak of 38,916 was way into the danger zone. It’s too bad 
they did not listen, as the market fell 56 percent starting 
‘on the first trading day of January 1990. And five years 
later Yamaichi Securities went bankrupt — quite a fall for 
the world’s sixth largest brokerage firm in 1989, There 
was one more interest rate rise in late December 1988 
in the governunent’s futile effort to deal with the bubble 
economy, In their poor response the government raised 
interest rates eight more months into August 1990 - and 
that in my opinion had a lot to do with the 20-year bad 
economy that they have had. | had found that most of the 
anomalies found in US markets were there in Japan, such 
as the turn of the month effect, etc. It was said that Japan 
was different. What was different was not the economics, 
but the culture. My Japan experience was a good one and 
led to three books and a number of articles, a rather good 
Nikkei put warrant arbitrage with fellow Wilmott colum- 
nist Ed Thorp, and a nine-year consulting to the research 
department relationship with the Frank Russell Company 
in Tacoma, Washington including the Yasuda Kasai model 
which | designed for them and which ushered in the era of 
multi-period stochastic programming asset-liability mod- 
els; for interested parties, see especially the 2007 ALM 
handbook | edited for North Halland with Stavros Zenios. 
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U.S. Bears’ Bets May Roil Japan’s Turmoil 


By BARBARA DONNELLY 
And MICHAEL R. SESIT 
Staff Reporters of 

THE WALL STREET JOURNAL 

U.S. investors on Nikkei put 
warrants, a relatively new invest- 
ment that reaps big profits if 
Japanese share prices plunge, have 
been sitting pretty through the Tokyo 
stock market’s recent upheavals. 

They also may be part of what is 
causing that turmoil. 

This is because the securities 
firms and banks that devised the 
Nikkei puts, such as Salomon Inc. 
and Bankers Trust New York Corp., 
often hedge them with complex 
trading strategies that increase 
pressure on Japanese stocks when 
prices are already falling. 

In effect, these institutions have 
revived the computer-driven “port- 
folio insurance” strategies that were 
a primary factor behind the U.S. 
stock market crash in October 1987. 

So far this year, the benchmark 
Nikkei index of 225 leading stocks that 
trade on the Tokyo Stock Exchange 
has plunged nearly 27% from its 
record high at the end of last year. On 
Monday, the Index fell 750.74 points, 
or 2.6%, to close at 28463.18. 

The fundamental reasons for the 
drop in Japanese stock prices are well 


known: rising Japanese interest rates, 
accelerating monetary growth, fears 
of higher inflation and a general belief 
that asset prices in Japan — both stocks 
and real estate — were overvalued. 
Waning confidence in  Japan’s 
monetary authorities has merely 
added to the investors’ anxieties. 

Now a growing number of people 
believe that portfolio insurance also 
played a part. While the Nikkei put 
warrants weren’t the cause of the 
Tokyo stock market’s drop, they say, 
the computerized hedging programs 
backing them exacerbated the decline 
once it started and added to the 
market’s volatility. 

“These puts aren’t taken lightly 
of the Ministry of Finance or any of 
the groups in Japan,” says a Canadian 
investment banker who worked on 
some of Bankers Trust’s warrants. 
“There were a lot of negotiations with 
the ministry. We got calls at two in 
morning. It was very sensitive.” 

In Tokyo, a Finance Ministry 
official says: “We can imagine [the 
puts have] some effect, but no one can 
measure them. We cannot trace how 
these products have affected the 
market. We intend to study it further.” 

Just how much the Nikkei puts 
have affected the Tokyo market is 
probably impossible to qualify. Some 


of the effect is muted by rules that 
require futures trading to close down 
once prices move 10%. 

Nonetheless, “The psychological 
effect is greater than the actual 
transactions,” a senior trader at one 
of Japan’s big securities firms says. 
“The problem is that [the big 
Japanese brokers] can’t do it and 
can’t explain it to their clients.” 

Japanese securities firms, which 
lack the technology and the expertise 
to do much computerized program 
trading, have criticized U.S. firms in 
recent months for disrupting the 
market by using such techniques. 
While the Americans don’t deny they 
use computerized trading strategies, 
they vehemently deny that their 
practices contribute to the Tokyo 
market’s violent fluctuations. 

Four Nikkei put warrants have been 
listed on the American Stock Exchange 
since early January. The first was 
offered by Denmark and underwritten 
by Goldman, Sachs & Co.; the others, 
by Bankers Trust with one and Salomon 
with two, quickly followed. 

The  Annex-listed warrants 
entitle investors to a payment in U.S. 
dollars, if the Nikkei stock average 
falls below the exercise or “strike” 
price specified in the war- 
Please Turn to Page C10, Column 5 


Bearish Bettors in U.S. May Be Partly 
Behind Upheavals in Tokyo 


Continued From Page C1 
rants’ prospectuses. Trading in the 
Annex puts has been active, at times 
accounting for as much as 40% of 
the Annex’s daily volume. 

Other Nikkei puts have been 
trading in Canada and London for 
more than a year, while private 
placements of Nikkei put options 
have been available to U.S. and other 
institutional investors since 1988. 

Combined, the publicly listed U.S. 
and Canadian put warrants represent a 


Source: The Wall Street Journal, 1990 


roughly $2 billion bearish bet against 
the Nikkei index, figures William 
Ziemba, a professor of management 
science at the University of British 
Columbia. Nobody knows for sure the 
value of the privately placed puts, but 
investment bankers involved in some 
private deals say the total could be 
about to times the listed warrants. 

To hedge the exposure on the 
Nikkei puts, the securities firms and 
banks use a few different strategies, 
including portfolio insurance, or 


“dynamic hedging,” as it is called 
now. This isn’t really insurance; 
rather, it is a trading strategy that sells 
progressively more stocks as the 
market declines and buys them back 
as prices rally. The goal is to limit 
losses in a falling market, but not 
leave the investor too far behind when 
prices are gaining. Without these and 
other strategies to hedge the puts, 
Issuers would have risked losing 
hundreds of millions of dollars in the 
recent plunge in Japanese stock prices. 
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NIKKEI PUT OPTIONS GOOD BUY 
FOR FOREIGN FUND MANAGERS 


Why foreign investors are finding Tokyo an attractive place to insure their global portfolios. 


ne of the great ironies in the 
Q esate run-up of 

Japanese stocks is that each 
rise makes it easier for foreign inves- 
tors to bet that the market might one 
day start back down. 

This has been made possible, in 
part, by infinitely bullish Japanese 
players who don’t shy away from 
taking the other side of the bet from 
foreign investors. 

The foreigners are buying put op- 
tions to sell the Nikkei 225 stock 
average. A put is an option to sell ata 
specified price within a specified 
period. Buyers pay a fixed premium 
and profit by each dollar that the Nik- 
kei falls below a certain level, or strike 
price. If the index hasn't fallen to the 
strike price when the option expires, 
the premium is forfeited. 

The draw for large investors is two- 
fold. First, the puts provide a cheap 
way to bet against the Tokyo market. 

For about 6 percent of the Nikkei, 
or close to ¥2,000, one can buya three- 
year option with a stock price equal to 
the index's current level. If the index 
falls just six percent the investor 
breaks even. 

Second, they are less risky than a 
second way of betting against the 
market called short selling; selling 
borrowed securities into the marketin 
the hopes of a price drop. 

One reason for the puts’ low cost is 
the large number of Japanese cor- 
porations and individuals willing to 
take the other side of the trade. 

According to Koichi Kozu, assistant 
manager for stock index futures and 
options with Nomura Securities Co., 
the largest buyers of call options are 
Japanese trading houses. Japanese 
corporations and private individuals 
are also large buyers. 

There are several ways, other than 
bullish sentiment, to explain the 
seemingly unlimited Japanese ap- 
petite for calls. 

Many Japanese companies, includ- 
ing major securities houses, book the 
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premium, now about 20 percent of 
the option’s exercise price, as profit. 
Very few investors will miss an easy op- 
portunity to increase yield on their 
portfolios, according to Kozu. 

In addition, David Baran, vice presi- 
dent of derivative products with 
Shearson Lehman Hutton, said the 
Japanese investors, including major 
securities houses, take the premiums 
and plow them back into the market. 

“They (Japanese investors) are long 
and, they're going to be long forever,” 
Baran quipped. 

Another attractive feature of the op- 
dons market is its relatively low mar- 
gin requirements. With margin 
requirements in many foreign 
markets rising in response to concern 
over the Crash of 1987, Tokyo remains 
a safe port for leverage. 

A final explanation is that the level 
of volatility in New York is nearly 
double that in Tokyo. In volatile 
markets, options sellers demand 
higher premiums to compensate for 
the increased risk of being forced to 
sell their stock or compelled to pur- 
chase additional shares. 

The biggest foreign buyers have so 
far been money managers and high 
net worth individuals. The put op- 
tions are only sold in private place- 
ments and large dollar amounts. 


The money managers say the puts 
are a cheap hedge for global 
portfolios. If the Japanese market 
sneezes and the rest of the world 
catches cold, these money managers 
will have their bases covered. 

“It's a good, cheap way of insuring 
our portfolio,” said Tiger Fund's 
Julian Robertson (see interview). 

Moreover, since losses on options 
are limited, unreformed Tokyo bears 
can wait for the bubble to burst from 
a safe distance. 

The second, riskier way of betting 
against the Tokyo market, selling 
stock short, cannot be done in Tokyo, 
because the Tokyo Stock Exchange 
charges exorbitandy high fees for 
loaning securities to member firms to 
discourage the practice. 

Most of this business is done from 
the London subsidiaries of foreign in- 
vestment banks. Shearson's book of 
Japanese equities loaned is “huge,” 
Baran said. This indicates that there 
are still many people around the 
world who believe the Nikkei is due 
for a fall. 

When asked if he thought Japanese 
investors took part in the short selling 
to hedge their portfolios, Baran said 
he doubted it. 

On the contrary, Baran said a 
favorite trading technique among 
Japanese investors is so-called front- 
running, in which an investor uses fu- 
tures contracts to purchase stocks with 
money he does not yet possess. 

A big question for a buyer of a put 
is what happens if your worst fears are 
realized and the Nikkei does collapse. 

First there is the risk that the yen will 
collapse with it. Another risk is that in 
a general market collapse there is no 
guarantee that the Japanese counter- 
party will be able to deliver on the 
trade. 

But then as one American banker 
said to me, if Nomura goes under, 
we'll have a lot more to worry about 
than options contracts. 

(Stephen Lukow) 
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Buying Stock? Consider Turn-of-the-Month Effect 


HEARD 
ON THE 


STREET 
By Michael Gonzalez 
Staff Reporters of 
THE WALL STREET JOURNAL 

You’ve heard of the “October effect,” 
when the stock market tends to go down. 
And of course there is the “January effect,” 
when it tends to go up. Not to mention the 
axiom: “Buy on Friday and sell on Monday” 
(or is it the other way around?). 

Now Frank Russell Co., a pension-fund 
consulting firm in Tacoma, Wash., is 
advancing the “TOM effect,” short for the 
turn-of-the-month, when Russell says stoc! 
prices tend to go up because that is when 
investors receive cash to put into the market. 

The idea isn’t completely original. 
Earlier studies by analysts Norman Fosback, 
Yale Hirsch, and Robert Ariel have come uj 
with similar results. But the new study, by 
Russell analyst Chris R. Hensel and William 
T. Ziemba of the University of Britis! 
Columbia, is one of the most extensive, 
covering the Standard & Poor’s 500-stoc! 
index and its predecessors from 1928 to 1993. 

Messrs. Hensel and Ziemba found that 
the turn-of-the-month period, defined as the 


As the Month Turns, the DJIA Exceis 
Average percentage price change in the DJIA per day for every month, 1915-1994 
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Source: Girinyl Associates —— $ 


last trading day of the month through the 
first four trading days of the next month, 
generated an average daily return six times 
that of the daily average for the rest of the 
days of the month. 

“It appears that this happens because of 
the flow of funds,” says Mr. Hensel. 
“There’s a number of things that happen at 
the end of the month — people get paid, 
dividends are paid, principal payments,” he 
says, as well as corporate contributions to 
pension or retirement funds. 

The Russell study found average daily 
returns for the turn-of-the-month period were 
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0.1236%, as opposed to an average daily 
return of 0.0186% during the 65-year test 
period. The probability of that happening, 
say the authors, is below one in a thousand, 
making the phenomenon statistically 
significant. 

The study follows daily closing 
prices on the Standard & Poor’s 500-stock 
index 1957, when the 
value-weighted index grew to 500 stocks, 
and on its predecessor, the S&P 90-stock 
composite, since 1928. 


since 


Mr. Hensel says psychology plays a 
Please Turn to Page C2, Column 3 


Turn, Turn, Turn: To Every Stock Price There 
Is a Reason for the Month-to-Month Price Jump 


HEARD 
ON THE 


STREET 
Continued from Page C1 
part, too. “People tend to aggregate things 


according to the calendar,” he says. “People 
observe what has happened to a stock during 
a month and then make decisions whether to 
invest or not.” 

Birinyi Associates, a Greenwich, Conn., 
firm that monitors stock movements, finds 
similar results for the period 1915 to 1994. 
The average daily price change in the Dow 
Jones Industrial Average surges just before 
month-end, when it jumps from minus 
0.02% on day 28 to plus 0.01 on day 29, 
0.12% on day 30, peaking at 0.19% on day 
two of the next month, then falling to minus 
0.01% on day seven. The biggest loser, on 
average, is day 19. 

One potential weakness in both the 
Russell and Birinyi data is that they 
represent price change only, and don’t count 
the dividends 
indexes would receive. One Birinyi analyst 
said that counting the dividends daily, which 


that an investor in such 


only became possible within the past few 
years, the 
turn-of-the-month effect. 


wouldn’t cancel out 


Source: The Wall Street Journal, 1995 


The Russell study found evidence of 
other “effects” known to Wall Street, such as 
the January effect, which analysts have long 
attributed to the halt in year-end selling for 
tax reasons by individual investors as well as 
of new money from corporate 
pension and retirement funds. 


inflows 


The October effect — when stocks go 
down — also turned up in the Russell study, 
with the October average daily return during 
the TOM period the lowest of any month. 
Thomas M. Keresey, chairman of Palm 
Beach Investment Advisers in Palm Beach, 
Fla., says one explanation for the October 
effect is that money managers tend to sell 
stocks during that month to raise cash before 
the end of their fiscal year, typically Oct. 31, 
in order to make once-a-year dividend 
payouts. 

But Mr. Keresey says he ignores these 
kinds of effects. “It doesn’t make any 
difference to me. When I buy a stock I buy it 
because it’s going to go to a certain price 
from a long term perspective,” he says. 

Several traders and money managers 
say they have heard of the TOM effect, but 
note that it is difficult to take advantage of it. 
“You can’t predict what’s going to happen to 
a particular stock,” says Anthony Conroy, 


vice president of global investment 
management at Bankers Trust. 

The Russell study’s authors take it a bit 
further and suggest that traders attempt to 
move into stocks or stock futures just before 
the turn of the month, and back into cash 
However, as the study 


acknowledges, suggestion doesn’t 


afterward. 
their 
include the impact of transaction costs, 
which would likely be considerable. 

Mr. Hensel even tried doing something 
similar — shifting from stocks to cash and 
back — with the S&P index funds and 
money-market funds at the Vanguard mutual 
fund group. But only until the Valley Forge, 
Pa., large mutual-fund family caught on. “I 
was overtrading according to their rules, so 
they basically asked me to stop,” he says. 

However, he says that the average 
investor can still time a monthly mutual-fund 
contribution check, putting it in the mail later 
in the month, say around the 20th, so it will 
go into the fund just before stock price 
returns historically turn up. 

But Jeffrey Rubin, an analyst with Birinyi 
Associates, is sceptical of doing even that much. 
The phenomenon is “obviously true,” he says. 
But he adds: “This is more for the short-term 
trader that wants to have an edge on the market.” 
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MAKING DOLLAR-COST AVERAGING EVEN MORE 
PROFITABLE 


THE TRIED-AND-TRUE DISCIPLINE CALLED DOLLAR-COST AVER- 
AGING works because it is so simple: Anyone who puts a fixed dollar amount 
into stocks or mutual funds at regular intervals will buy more shares when the 
price per share is lower and buy fewer when the price per share is higher, lead- 
ing to a lower overall cost per share. But recent studies show that it pays to 
add just a little complexity. Making those regular purchases at the right time 
of the month can make dollar-cost averaging work even better. 

According to Yale Hirsch, editor of The Stock Trader’s Almanac, over the past 
six decades stocks have done 2.5 times better in the first half of the month 
than in the second half. A 1996 study by Chris Hensel, client executive at the 
Frank Russell Company, and William Ziemba, a professor at the University of 
British Columbia, found an even bigger difference, with average returns of 0.07 
percent in the first half of the month and negative 0.02 percent in the second 
half. 

According to Hensel and Ziemba, the biggest returns come between the last 


trading day of one month and the fourth trading day of the following month. 


Daily returns then average 0.12 percent versus 0.02 percent daily for the entire 
month. 

Taking advantage of this “turn of the month” effect, Hensel says, by investing 
on the third- or second-to-last day of the month. For example, a dollar-cost 
averager would time his check to arrive by September 27. His dollars would 
then be invested when shares are relatively cheap and before prices climb. 
“The turn of the month is when the lion’s share of the gains has happened over 
time,” says Hensel, “so it makes sense for investors to try to be invested by 
then.” 


—J.B. 
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Investment Results from Exploiting Turn-of-the-Month Effects 


1 Introduction 


Investment advisors have argued that US stocks have substantial rises at the turn 
of the month.’ The turn of the month (called TOM) is defined to be the last 
trading day of the previous month and the first four trading days of the new 
month. In this Commentary, we examine the monthly return patterns in the S&P 
500 from 1928 to 1993 to investigate this phenomenon. We then examine various 
investment strategies that could practically exploit the turn-of-the-month effect. 
The results are of potential use to institutional investors concerned with timing of 
stock purchases and sales and for long-term investors who utilize the S&P 500, or 
similar benchmarks, for performance measurement of portfolios. 

The intense investigation of seasonal anomalies in US and other security 
markets brings up the issue of data mining.’ Merton (1985), Black (1986, 1992), 
and Lo and MacKinlay (1990) in particular discuss the dangers of finding what 
appear to be genuine anomalies but may simply be random data variations. 
Perhaps the best remedy against data snooping is new data and convincing 
reasons for the effects. 

One explanation for the high returns at the turn of the month is the 
considerable cash flows coming into the stock market during this time.’ Many 
salaries, dividends, principal payments, and debt interest are payable on the last 
and first days of the month. There are also institutional, corporate, and pension 
fund purchases during the turn of the month. These cash flows vary by month and 
lead to higher average returns in January, which has the highest cash flow.’ 

The data used in this study consists of the daily closing prices of the S&P 500 
stock index for the 65-year period from February 1928 to June 1993, and was 
supplied by Data Resources Incorporated. The S&P 500 is a value-weighted index 
of large capitalization US stocks. Since March 1957, it has consisted of 500 large 
stocks weighted by market value (price times number of shares outstanding). 
Prior to then it consisted of 90 large stocks. Index futures contracts on the S&P 
500 have been traded since 1982 on the Chicago Mercantile Exchange. Futures 
options on the S&P 500 are traded at the Chicago Mercantile Exchange, and 
options are traded at the Chicago Board Options Exchange. The S&P 500 is an 
appropriate index for studying these anomalies, since it is value-weighted, broadly 
diversified and representative of the market portfolio, and has a very long history. 
Additionally, the S&P 500 is often used in passive portfolio management to 
represent the market and is a common performance benchmark for US stock 
portfolios. 


2 The Return Patterns in the S&P 500 


To avoid data-snooping biases, we have defined the turn of the month as Ariel 
(1987) did in his study of the cash markets from 1963-1981. 


Trading 
Category Acronym Days 
Turn of the month TOM -1 to +4 
Second week SW +5 to +9 
First half of the month FH -1 to +9 
Rest of the month ROM +9 to -2 


Thus, TOM is the last trading day of the previous month and the first four trading 
days of this month. Analyzing the S&P 500 data from 1928 to 1993, we found 
that the five trading days during TOM had very high returns. Three of these five 
days had returns significantly above average. Moreover, all of the rest of the 
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2 The Return Patterns in the S&P 500 


month had returns near or below the average and none of these days had returns 
significantly above average. 


Figure 1 
Average Daily Returns in the S&P 500 Cash Market 
by Trading Day of the Month 
(February 1928 to June 1993) 
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Figure 1 shows the average return pattern by trading day, and average daily 
return. The TOM (days -1 to +4) had mean returns significantly above average. 
However, over the 65-year period studied here, the five-day period -2 to +3 had 
even higher returns. To examine the consistency of returns by day, we calculated 
the mean daily return and standard deviation of daily returns for rolling 60-month 
windows. Examples of two trading days are presented. Figures 2a and 2b show 
the rolling 60-month returns and standard deviations of the contrasting trading 
days -1 and -5. For the high-return day -1, there are consistently positive returns 
with relatively constant standard deviation (after the 1930s). For the low-return 
day -5, there are consistently negative retums and somewhat higher and more 
variable standard deviations. 
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Figure 2a 
Rolling 60-Month Returns and Standard Deviations for Trading Day -1 
(February 1928 to June 1993) 
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Figure 2b 
Rolling 60-Month Returns and Standard Deviations for Trading Day -5 
(February 1928 to June 1993) 
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2 The Return Patterns in the S&P 500 


The average return per day for the 65 years was 0.0186%. The mean returns 
during the turn of the month were over six times as high at 0.1236%, which has a 
t value of 5.94 for the hypothesis that the turn of the month days have returns 
above the mean. The probability of finding a daily mean return of 0.1236%, when 
the true daily mean return is 0.0186%, is less than one in a thousand. The first half 
returns were also significantly above average.’ However, during the rest of the 
month, the returns were significantly below average and also significantly 
negative. 


Table 1 
Average Daily Returns and t values for the S&P 500 
(February 1928 to June 1993) 


Average Daily 
Trading Days Return (%) t value 
All days 0.0186 0.00 
TOM 0.1236 5.94 * 
FH 0.0703 4.13 * 
ROM -0.0235 -3.71 * 


* Asterisks denote t values for returns that are significantly 
different from the average at the 5% level of significance, 
using a one-tail t test. 


There was also a significant seasonality in the S&P 500 returns. January, 
March, May, and July had mean returns during the turn of the month significantly 
above the overall average (see Table 2). In every month, the mean daily returns 
during TOM were higher than the average daily return. January and July also had 
significantly higher mean daily returns during the first half of the month than the 
average daily return. The mean daily returns in the first half were positive in all 
months except September. The rest of the month had negative mean daily returns 
in all months except January, June, August, and December. The rest of the month 
for September and October—ited in the media as a time for stock market 
losses—did have large negative mean daily returns, but only September was 
significantly below average. May and November also had significantly negative 
mean daily returns during ROM. 
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Table 2 
Average Daily Returns by Month, 
During the Turn, First Half, Rest of the Month, and Whole Month 
(February 1928 to June 1993) 


Average Daily Returns (%) 


TOM FH ROM All Days 
S&P 500 Index -1 to +4 -1 to +9 +10 to -2 -1 to -1 
January 0.2061* 0.1025* 0.0359 0.0651 
February 0.0807 0.0170 -0.0214 -0.0024 
March 0.1876* 0.0768 -0.0212 0.0208 
April 0.0503 0.0566 -0.0169 0.0161 
May 0.1653 * 0.0819 -0.0836* -0.0107 
June 0.1287 0.0669 0.0033 0.0315 
July 0.2258* 0.1697* -0.0050 0.0738* 
August 0.0645 0.0672 0.0129 0.0364 
September 0.0976 -0.0175 -0.0978* -0.0605* 
October 0.0445 0.0632 -0.0787 -0.0178 
November 0.1108 0.1038 -0.0821* 0.0071 
December 0.1217 0.0564 0.0599 0.0584 
All Months 0.1236* 0.0703* -0.0235* 0.0186 
*Asterisks denote returns that are significantly different from the average daily return for all days 


(0.0186%), at the 5% level of significance, using a one-tail t test. 


The evidence is that all of the gains in the S&P 500 from 1928 to 1993 were 
at the turn and during the first half of the month. The returns in the rest of the 
month were nonpositive. We investigated the frequency distribution of the largest 
percentage gains and losses for various times of the month. We found that the 
higher average returns during the FH in the 1928 to 1993 sample were partly 
composed of a higher frequency of very large returns and a lower frequency of 
very low returns. However, the results were not strong enough to have statistical 
significance. Thus, large gains and losses have at most a minor effect on the result 
that all the gains were in the FH during 1928 to 1993.* 


3 Investment Strategies 


The previous sections have shown that the TOM and FH have very strong daily 
returns, on average, and these returns are not the result of a few large gains and 
losses. In this section, we compare the return percentages, wealth relatives, and 
correlations of large stocks, small stocks, various types of bonds, cash, and two 
investment strategies. The strategies are to invest in the S&P during either the 
TOM or FH and in cash for the remainder of the month. This allows us to 
compare these investment strategies with other buy and hold investments such as 
S&P 500 index and small cap funds. No transaction costs were included. 
However, institutional investors could use futures to implement these strategies, 
dramatically reducing the impact of transaction costs.” Such an investor could use 
futures to adjust the net long S&P 500 to cash position during periods with low or 
negative average returns. The comparison of different idealized strategies 
provides an estimate of the potential value of investing based on the historical 
evidence of higher mean returns during TOM and FH. 
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3 Investment Strategies 


Table 3 
Returns and Growth for Various Investment and Seasonal Strategies 
(February 1928 to June 1993) 


Investment Monthly Avg. Monthly St. Yearly Avg. Yearly St. Growth of $1 
Strategy Return (%) Deviation Return(%) Deviation Investment 
Large Cap 0.79 5.80 9.50 20.11 439.13 
Small Cap 0.96 8.71 11.53 30.18 1,483.63 
TOM + .8 Cash 0.84 2.54 10.13 8.79 758.36 
FH + .6 Cash 0.92 3.64 11.06 12.62 1,290.97 


Table 3 compares Large Cap and Small Cap returns with the returns from two 
investment strategies. Ibbotson Associates supplied the total return data for the 
large capitalization (S&P 500) and small capitalization (bottom 20% of 
companies, capitalization weighted). The final two strategies (TOM + 0.8Cash 
and FH + 0.6Cash) invest in the S&P 500 during the turn of the month or the first 
half and then in cash for the remainder of the month. It is assumed that these 
strategies are in cash 80% and 60% of the month, respectively. These two 
strategies do not include dividends for the periods that they are invested in the 
stock market. Hence their returns are a conservative estimate of the true total 
returns. 

The strategies of being long in the S&P 500 during TOM or the FH and then 
in cash had consistently high mean returns that were mean-variance superior to 
the buy-and-hold Large Cap strategy. That is, the TOM and FH strategies had 
higher returns and lower standard deviations than an investment in the S&P 500 
for the entire period. 

The growth of $1 invested in the various investment strategies for the entire 
1928-1993 period is shown in the last column of Table 3. The results show the 
growth of one dollar over the 65 years of the sample. In interpreting these results, 
one must consider the number of trading days in each period; TOM had five days, 
FH had 10 days, and the whole month had about 20 to 22 days. The total growth 
in the small capitalization index, $1,483.63, was higher than the TOM-plus-cash 
and FH-plus-cash strategies that returned $758.36 and $1,290.97, respectively. 
The TOM and FH strategies dominated Large Cap stocks. Adjusting for risk via 
standard deviation, points to the superiority (high returns and relatively low 
standard deviations) of the TOM and FH strategies during the 65-year sample 
period. 

Next we calculated the correlations of these strategies with other investments 
to see if the correlations make these strategies more or less attractive to 
institutional investors. If the correlations are relatively low with their other 
investments, these strategies can provide additional diversification when added to 
their current holdings. Ibbotson Associates total return data was used for large and 
small stocks, high-yield corporate bonds, long-term (twenty years) and 
intermediate-term (five years) government bonds, and cash measured by the 90- 
day T-bill return. 

Table 4 displays the correlations between the various investment returns from 
February 1928 to June 1993. The TOM- and FH-plus-cash strategies have a 
relatively low correlation with large and small capitalization stocks—namely, 
0.46 and 0.38 for TOM and 0.67 and 0.57 for the FH, respectively.” These 
correlations are of a level that would provide attractive diversification benefits 
(reduced risk) when included in a traditional portfolio of stocks and bonds. Hence 
these strategies may be considered as separate asset classes in mean-variance or 
related asset allocation studies just as smal] or large capitalization or foreign 
stocks might be. 
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Table 4 
Correlations for Various Investment and Seasonal Strategies 
(February 1928 to June 1993) 


Small |Corporate|Long Govt} Intermed. 
Govt Bond} Cash 


Correlations +.8 Cash|+.6 Cash 


Large Cap 
Small Cap 
Corporate Bond 0.22 
Long Govt Bond 0.17 
Interm. Govt Bond 


Cash 0.00 
TOM+.8Cash | 0.46 1.00 
FH + .6 Cash 0.67 0.69 | 1.00 


4 Concluding Remarks 


There was a substantial turn-of-the-month effect in US large capitalization stock 
prices as measured by the S&P 500 during the 65-year period 1928-1993. The 
results show that the mean returns in the stock market were significantly positive 
in the turn and first half of the month and significantly negative in the rest of the 
month. 

The cumulative wealth effects of investment during various time periods 
magnify the effects. The results indicate that the total return from the S&P 500 
over this 65-year period was mostly received during the turn of the month. The 
strategy of being long the S&P 500 during the TOM or the FH and T-bills 
otherwise had very high total returns (exceeded only by small stocks); and when 
risk is considered, dominated all the strategies considered, including the small 
stocks. 

The results point to an advantage from investments in TOM and the FH, as 
opposed to ROM, on average. However, there may be different results in a 
particular year depending upon economic fundamentals and economic news 
shocks. 
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6 Endnotes 


" See Merrill (1966), Fosback (1976), and Hirsch (1986). 


* Recent literature surveys of trading anomalies appear in Hawawini and Keim 
(1995) and Ziemba (1994). 


° Ogden (1987, 1990) discusses empirical support for the flow of funds into the 
stock market hypothesis. Cadsby and Ratner (1991) provide further support for 
the turn of the month cash flow hypothesis using data from Canada, the United 
Kingdom, Australia, Switzerland, and West Germany. Cadsby and Ratner found a 
significant turn-of-the-month effect on trading days -1 to +4 in these countries. 
Ziemba (1991) studied the turn-of-the-month effect and the cash flow hypothesis 
in Japan. 


* The following explanations for the turn of the month have also been advanced: 
behavioral (Penman 1987); inventory adjustments of different traders (Rock 1989 
and Ritter 1988); the timing of trades by informed and uninformed traders 
(Admati and Pfleiderer 1988); specialists’ strategies in response to informed 
traders (Admati and Pfleiderer 1989); seasonal tax-induced trading (Lakonishok 
and Smidt 1986); and window dressing induced by periodic evaluation of 
portfolio managers (Haugen and Lakonishok 1988 and Ritter and Chopra 1989), 


* Ariel (1987) has documented the TOM effect for small- and large-capitalization 
stocks for the 19 years from 1963-1981. His data consisted of the equal- and 
value-weighted indexes of all NYSE stocks from the Center for Research on 
Security Prices (CRSP) tape. The first half of the month had all the gains. ROM 
had negative returns. Hence, investment in the first half of the month provided 
more than all the year’s stock market gains. Some additional support for Ariel’s 
findings for the period from 1982-1988 appears in Cinar and Vu (1991), Keim 
and Smirlock (1987), and Linn and Lockwood (1988). Lakonishok and Smidt 
(1988) investigated daily returns in the Dow Jones Industrial Average from 1897 
to 1986. Using this price weighted average of 30 stocks, they also found high 
returns at the turn of the month. 


ê The shifting of higher mean return to days -2 to +3 from -1 to +4 may be related 
to futures anticipation of this effect. Hensel, Sick and Ziemba (1994) using data 
from 1982-1992 found that the TOM effect is anticipated to some extent in the 
futures on days -4 to -2. 


’ We also examined the TOM, FH and ROM returns by decades from the 1930s 
through the 1980s. Although there was some variability across decades, there 
were consistently high returns during TOM and the FH in all the decades. 


* This result is consistent with the hypothesis that large gains and losses are the 
result of financial news information shocks that occur randomly in time; and they 
are distributed in TOM, the FH and ROM in proportion to the relative number of 
days in these trading periods. 


° Anticipation by the futures market of these stock market gains would need to be 
considered using results such as those in Hensel, Sick and Ziemba (1994). 


"° These correlations can also be estimated theoretically. Suppose the value of the 
small capitalization portfolio and S&P 500 portfolio follow a joint geometric 
random walk with a constant drift, correlation, and variances throughout the 
month. Also suppose that investing in cash yields no return variance. That is, 
interest rates are constant. Let f denote the fraction of days of the month for 
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which the trading is invested in stock. For the TOM-plus-cash strategy, f is 
approximately 0.2, and for the FH-plus-cash strategy, f is approximately 0.4. Let 
the correlation between the returns of the fully invested S&P 500 portfolio and 
either the small capitalization or large capitalization portfolio be p. Then the 
correlation between the returns of the TOM- and FH-plus-cash trading strategies 
with the returns of S&P portfolio or the small capitalization portfolio equals vf p. 
The correlations of the fully invested portfolio with the S&P 500 portfolio is 

p = 1, so we would expect the TOM-plus-cash and FH-plus-cash strategies to 
have correlations of approximately J0.2 = 0.45 and /0.4 = 0.63 with the large- 
capitalization stocks. This is consistent with the data, suggesting that the variance 
of the S&P 500 portfolio is the same during the TOM or FH as it is throughout the 
rest of the month. Similarly, by Table 4, the correlation between the S&P 500 and 
the small capitalization portfolio is p = 0.86 over the whole month. This implies 
correlations between the small capitalization portfolio and the TOM-plus-cash and 
FH-plus-cash strategies of 0.39 and 0.54, respectively. These are consistent with 
the empirical evidence. However, the theoretical correlations are slightly less than 
the empirical correlations for the TOM-plus-cash strategies, which suggests that 
the variance rate of the S&P 500 portfolio increases slightly during the turn of the 
month. 
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Summary of 

"Playing the Turn of the Year Effect with Index Futures” 
by Ross Clark and William T. Ziemba, 

November-December issue of Operations Research 


For twenty years since Gene Fama’s doctoral dissertation at the 
University of Chicago in the early 1960's, the academic financial community 
considered the case for the efficient market hypothesis air tight. Current 
prices of stock, bonds, treasury bills, and other investments were the right 
prices; they fully reflected all the publicly available information concerning 
their value; the best estimate of their future value was the current price. 
You would do just as well throwing darts at the stock pages as hiring a 
professional portfolio manager. Indeed when three executives of Forbes 
Magazine threw 28 such darts in the summer of 1967 they expected their 
portfolio to more or less match the market indices. By the summer of 1984 
their $28,000 had grown to $132,000, excluding dividends, a 370% gain, which 
was more than ten times the gain of the Dow Jones industrials. Why did they 
do so well? After all, according to the prevailing wisdom supplied by the 
capital asset pricing model, one can only achieve higher mean returns by 
bearing more risk. 

Risk comes in two forms: 1) market risk, measured by movement of the 
portfolio as compared to the market ~ the infamous beta (the covariance of the 
portfolio with the market divided by the variance of the market) =- one cannot 
do anything about this risk; and 2) diversification risk, which for 
independent securities drops off with the reciprical of the number of 
securities. With 28 fairly independent securities the diversification risk is 
small (about 8% of its maximum). The portfolio beta was about par equal to 


the market beta of one. 
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The portfolio did so well because it was the small firm effect in 
disguise. Small capitalization stocks, what you emphasize when you put equal 
dollar amounts on a relatively large number of random securities, simply 
return more on average than large capitalization stocks such as those in the 
Dow. In 1981 Rolf Banz and Marc Reinganum, both Chicago Ph.D. students, 
showed that even when you risk adjust, the small stocks return much more than 
the big stocks. Thus the field of stock market anomalies or, as we now call 
them, empirical regularities gained some respectability. 

Dozens of studies have since documented numerous manifestations of the 
data that seem to violate the efficient market hypothesis: low price 
earnings, Value Line #1's, closed end mutual funds selling at large discounts, 
S&P 500 additions, stocks that receive no dividends and the like seem to have 
excess returns. Purists are still trying with little success to find risk 
measures that bring these violations to the theory back into line. It is a 
rough task - how does one explain that over the past ninety years fully 51% of 
all the gains in the Dow Jones industrials have occured on the 8 to 10 trading 
days each year that preceed the major U.S. holidays? And that the probability 
that the Dow Jones index will fall on a Monday following a negative Friday is 
nearly 75%? 

Perhaps the most pervasive of the anomalies is the small firm effect. 
From 1926-1985 a $1 invested in T-bills grew to $7.47, long term government 
bonds to $11.03, long term corporate bonds to $16.55, common stocks to $279.12 
and small stocks (those New York Stock Exchange securities in the bottom fifth 
in capitalization) to $1,241.24. The small stocks returned nearly five times 
the average stocks and more than ten times the largest stocks. A mutual fund 
that invests precisely in this way, essentially buying and holding the NYSE 
bottom fifth, of which Professor Fama is a major architect, manages about $5 


billion in pension money. 
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In 1983, Donald Keim, another Chicago Ph.D. student, discovered that at 
least half of small stock gains occured in January. Richard Roll at UCLA and 
others then found that most of these January gains occurred on five trading 
days (-1, the last day in December, and +1,...,+4, the first four days in 
January). On these days the small stocks simply leap enormous amounts 
relative to the big stocks, and have consistently done so nearly every year 
for decades. In 1986 Jay Ritter found that during the fourteen years, 
1971-1984, the difference of small stocks versus big stocks over the nine 
trading days (-l to +8) ranged from 3.0 to 41.5% and averaged 9.9%. The risk 
premiums are higher on these days for the small stocks but that alone does not 
explain these extremely high returns. Indeed, recent studies have shown that 
risk as measured by the capital asset pricing model is rewarded by increased 
returns only in January. 

The causes of the turn of the year effect seem to be: (a) tax loss 
selling, (b) renewed buying interest in small stocks in the new year because 
of the availability of excess cash balances that, because of the low trading 
volumes, causes upward price pressure, (c) high transactions costs that 
prevent these patterns from being arbitraged away, (d) portfolio manipulations 
and (e) turn of the month and quarter price rise effects. 

I have been interested in the study of speculative markets, be it 
blackjack, lotto games, horseracing or financial securities. In each case the 
procedure is to (1) identify an edge and (2) to determine a proper investment 
strategy. 

There is strong evidence that small stocks outperform large stocks at the 
turn of the year. Yet the transactions costs eat away most, if not all, of 
the potential gains. The transactions costs on index futures is a tenth or 


less of the corresponding basket of securities. Hence, a strategy that should 
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be very profitable is to hold long positions in a small stock index and short 


positions in large stock indices. 


Stock index futures began trading in the U.S. in 1982 and the number of 
different contracts available and their volume has been increasing steadily. 
The ideal way to play the turn of the year effect is to be long the smallest 
stocks and short the largest stocks in liquid index contracts. Our experience 
is with the spread between the Value Line and S&P indices called the VL/S&P 
spread. This is not an ideal way to play the effect, but it is the best we 
have found so far and it has been successful. The Value Line index is an 
equally weighted geometric weighted average of the prices of nearly 1,700 
securities with futures traded on the Kansas City Board of Trade and futures 
options traded on the Philadelphia Stock Exchange. IBM and the smallest 
company in the index are treated equally in the weighting. The Value Line 
index has a downward drift of about 5.5% per year relative to the component 
securities in the index because of the geometric averaging, due to the 
geometric-arithmetic averaging inequality. The S&P 500 futures contract 
is traded on the Chicago Mercantile Exchange. It is value weighted. Hence, 
IBM and the other large stocks count much more than the medium size stocks at 
the bottom of the index. Hence, in a crude fashion, the VL/S&P spread gives 
you the small stocks long and the big stocks short. The bigger the stocks 
are, the more short they are. However, all the small stocks and medium stocks 
are held in the same proportion. 

When should you trade? We have found that the rule: buy the spread on 
the first closing uptick, starting on December 15 and definitely by the 17th, 
and sell on January 15, has worked well. Waiting until (-1) now seems to be 
too late: possibly sufficient finance professors and their colleagues, and 


other students of the turn of the year/January effect move the VL index. 
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There seems to be a bidding up of the March VL future price relative to the 
Spot price. By January 15th, the biggest gains are over and the risks 
increase. During the ten years 1977-1986, the spread dropped 0.92 points, on 
average, with a high variance. The projected gain from a successful trade is 
0-5 points ($500 per point), and averages 2.85 points or about $1350 per 
spread. On average, the December 15 to (-1) day gain on the spread, is 0.57 
points. However, it was 1.05 in 1985 and 3.15 in 1986 which may reflect the 
fact that with the thin trading in the VL index. The market can be moved with 
a reasonably small number of players, who are learning about the success of 
this trade, i.e. the basis was bid up anticipating the January move. Clark 
and I made this trade successfully in 1984/85, 1985/86 and 1986/87. For our 
1986/87 play we attempted to optimize our investment using modifications of 
the capital growth or Kelly criterion which seems to have the most desirable 
properties for repeated investments over time. The procedure allows one to 
balance expected gain with probability of success and leads to a prescription 
of the optimal number of contracts to hold consistent with one's risk-reward 
preferences. 

We employed the concepts in a $100,000 speculative account for a client 
of CARI Ltd., a Canadian investment management company. We decided to 
purchase five VL/S&P spreads to approximate a slightly less than 25% 
fractional Kelly strategy. Watching the market carefully, we bought these on 
December 17, 1986 at a spread of -22.18 which was very close to the minimum 
that the spread traded. On the 15th the spread closed at -20.90 and on the 
16th at -22. The spread increased in value to -18.15 at the end of the year 
in a flat and declining stock market for a gain of 3.85. In January the stock 
market took off in an impressive style with ten consecutive up days. The 


Spread continued to gain and we cashed out at -16.47 on the 14th for a total 
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gain of 5.55 points per contract or a total gain of $14,278.50 after 
transactions costs. The spread began to drop sharply on the 15th and the drop 
escalated into a rout of the small stocks in comparison with the big stocks in 
the S&P 500. At the end of January the index stood at -31.45. The trade must 
be handled carefully. More or less the December 15th to January 15th period 
seems best. More and more players move the market in the December 15-31 
period and, as in 1985/86, most of the gains occured then. 

Will the stock market crash of October 19, 1987 or the tax changes affect 


the turn of the year effect? We think not but only time will tell. 


William T. Ziemba 


Alumni Professor of 
Management Science 


University of British Columbia 


Dr. Ziemba is currently working on a book tentatively titled "Strategies for 
Making and Keeping Excess Profits in the Stock Market" and he is the co-author 


of "Dr. Z's Beat the Racetrack," Morrow, 1987. 
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‘Political’ investments turn out to be lucrative 


By Greg Heberlein 
Seattle Times Business Reporter 

Investing may be simpler than 
it seems, a research study from 
the world’s most prominent asset 
consultant says: 

Buy stocks of smaller 
companies when Democrats 
control the White House, buy 
bonds when Republicans are in 
command. 

The research report was 
issued today by the Frank Russell 
Co. of Tacoma. The company is 
the premier consultant to pension 
and investment groups. Much of 
Russell’s work centers on 
evaluating adviser performance. 

The research comes with no 
guarantee future results will 
match past returns. But it notes 
that because all stocks tend to do 
better in the second half of a 
president’s term, and because 


President Clinton is a Democrat, 
small-stock performance could be 
vigorous. 

“If past trends hold true, 
small-stock returns could 
increase in the second half of 
Clinton’s term,” said Chris 
Hensel, Frank Russell research 
analyst. “This research 
demonstrates that investing in 
specific categories of stocks and 
bonds according to the party in 
the White House has meant 
dramatically higher returns.” 

How much higher? Much, 
much higher. 

According to a study that 
Hensel co-authored with William 
Ziemba of the University of 
British Columbia, small-company 
stocks performed 10 times better 
under Democrats: 20.5 percent a 
year vs. 1.9 percent a year under 
Republicans in the 1929-1992 


period — 63 years. Larger stocks 
had statistically identical returns. 

But bonds boomed under 
the GOP. The Russell study 
showed that long-term 
corporate bonds, long-term 
government bonds, inter- 
mediate government bonds and 
cash outdistanced stocks on an 
annual basis by 4.9, 5.5, 4.6, 
and 2.9 percent, respectively. 

Already during the Clinton 
term, small-company stocks 
including dividends have risen 29 
percent vs. 11.7 percent for larger 
companies. 

Hensel and Ziemba’s study is 
called U.S. Investment Returns 
During Democratic and 
Republican Administrations, 
1928-1993. 

The Russell 
calculate two 

PLEASE SEE Stock Study ON E5 


company 
in- 


Mixing politics with investment decisions 


proves to be lucrative 


Stock Study 
CONTINUED FROM E 1 
vestment strategies that would 
have maximized returns off this 

theory. 

From January 1942 to 
December 1993, $1,000 invested 
in small-company stocks during 
Democratic terms and in large 
stocks during Republican terms 


Source: Seattle Times, 1994 


would be worth $2.7 million 
today vs. $1.1 million if invested 
only in small-company stocks 
and $289,000 if invested only in 
large-company stocks. 

During the same time period, 
$1,000 invested in 
small-company stocks during 
Democratic administrations and 
intermediate-term government 


bonds during 
administrations would have 
returned $1.2 million vs. 
$103,000 from the traditional 
60/40 mix of larger stocks and 
intermediate bonds. 

Russell defines the largest 
1,000 stocks as large-company 
stocks and the next 2,000 stocks 
as small-company stocks. 


Republican 
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This paper discusses the Nikkei put warrant market in Toronto and New York during 1989-1990. Three 
classes of long term American puts were traded which when evaluated in yen are ordinary, product and 
exchange asset puts, respectively. Type I do not involve exchange rates for yen investors. Type II, called 
quantos, fix in advance the exchange rate to be used on expiry in the home currency. Type IH evaluate the 
strike and spot prices of the Nikkei Stock Average in the home currency rather than in yen. For typically 
observed parameters, type I are theoretically more valuable than type II which in turn are more valuable than 
type III. In late 1989 and early 1990 there were significant departures from fair values in various markets. This 
was a market with a set of complex financial instruments that even sophisticated investors needed time to learn 
about to price properly. Investors in Canada were willing to buy puts at far more than fair value based on 
historical volatility. In addition, US investors overpriced type II puts fixed in dollars rather than the type 
Ps in yen. This led to cross border and US traded (on the same exchange) low risk hedges. The market’s 
convergence to efficiency (that is, all puts priced within transaction cost bands) took about one month after 
the introduction of the US puts in early 1990 leading to significant profits for the hedgers. 


Keywords: option mispricing, cross-border trading, Nikkei stock exchange 


1. Introduction 


The Japanese stock market rivals that of the US in size and importance. Its growth in trading 
volume and capitalization has been large. The markets in Tokyo, Osaka, Nagoya and five other 
regional exchanges that now trade were closed during World War II and then reopened in May 
1949. Post war construction and aid by the US helped the Japanese economy grow quickly. By 
1960, 9 % of the world’s equity capitalization was Japanese compared to 58 % in the US, 27% in 
Europe and 6% in the rest of the world. By 1980 Japan had increased its share to 15 % mainly at 
the expense of Europe whose share fell to 20 %. The rest of the world doubled its capitalization to 
12 % and the US still had the majority with 53 %. The 1980s were a period of economic excesses in 
the US that led to a weakening of the strong economic base held in 1980. The Reagan economic 
policies led to large deficits in both the overall budget and in trade, as well as large increases in 
military spending and debt payments (see Modigliani (1988) and Hatsopoulos, Krugman and 
Poterba (1989) for analyses). Meanwhile Japan maintained a policy of high investment in plant 


This paper was written in 1991 for the presentation of Ziemba (1991a). Relevant references and data occurring in the 
meantime have been updated to 1995. 
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and equipment and R&D, financed through policies that emphasized and rewarded high savings. 
Company expansion proceeded more through debt and retained earnings than equity. Debt was 
readily available and was at low nominal and real interest rates particularly through banks in the 
same industrial grouping (keiretsu). 

The 1980s were very financially favourable for Japanese firms. The combination ot low interest 
rates, easy access to funds, strong export marketing expertise, emphasis on quality, access to 
foreign markets while maintaining structures and rules making imports to Japan difficult, led to 
an enormous relative wealth transfer from the US to Japan. By December 1988, Japan’s equity 
capitalization was 44%, Europe’s 21 %, the rest of the world’s 6 %, and the US fell to 29 %. After 
the 1990-92 stock market decline, Japan’s share fell to 25% as of September 1992. The US was 
then 40%, Europe 25% and the rest of the world 6%. 

French and Poterba (1991) and Ziemba and Schwartz (1991) have argued that to properly 
measure capitalization one must adjust for company cross holdings. If company A holds much 
of company B’s stock and vice versa, then the true market capitalization of A plus B is less 
than the sum of the individual capitalizations. Japan and many of the European cconomies such 
as Germany and Italy have extensive cross holdings. In 1990 about 71 % of Japan’s equity was 
cross-held and rarely, if ever, traded. Calculations show that the capitalization of Japan in the late 
1980s was overstated by about 25 % (see for example, McDonald, 1989 and Ziemba and Schwartz, 
1991). After such adjustment, the shares in 1988--89 were about 39 % for Japan, 33 % for the US, 
22% for Europe and 6% for the rest of the world. 

Historically, Japancse stock markcts have been much more influenced by foreign markets than 
the reverse as shown by Becker, Finnerty and Gupta (1990), Hamao, Masulis and Ng (1990) 
and Ziemba and Schwartz (1991). The transmission of mean returns and volatility was mostly 
unidirectional until the October 1987 world wide stock market crash. Since the crash, stock 
price movements in Japan have had more impact on those in New York and London. However, 
the reverse effect is much stronger (see for example, Hamao, Masulis and Ng (1991)). It may be 
argued that this integration of capital markets was small because the Japanese markets were 
insulated to a large extent from foreign influence and not deregulated. 

Deregulation of Japan’s financial markets began in earnest in 1987 with the introduction of the 
first equity index futures contract, the Kabusaki 50, which traded in Osaka. Futures on the Nikkei 
Stock Average had begun trading on the Singapore Monetary Exchange (SIMEX) in 1986. Bailey 
(1989) discusses the early history of these two futures contracts. In September 1988, trading began 
in futures contracts on the more popular Nikkei 225 stock average and the Tokyo stock price 
(Topix) index which were traded in Osaka and Tokyo, respectively. These contracts allowed 
foreign investors and institutions to easily hedge positions in Japanese equities and to engage 
more fully in a variety of types of programmed trading including index arbitrage and portfolio 
insurance. During 1988 and 1989, the Japanese equity markets increased dramatically in trading 
volume and market capitalization. This was a period of cheap and easily available money for 
corporate and individual investors and speculators. 

The equity warrant bonds issued in Luxembourg with the warrants trading in London was one 
such example. By adding the warrants which were stripped off and traded separately the bonds 
could offer low coupons. Hedging the proceeds of the bond sales, which were mainly in dollars, 
back into yen with its considerably lower interest rates, provided net costs at borrowing at close to 
zero percent. The equity warrants, when exercised several years later if they were in-the-money, 


Chapter 7: Risk Arbitrage in the Nikkei Put Warrant Market of 1989-1990 183 


Risk arbitrage in Nikkei put warrant market 245 


provided an additional source of funds for the firm to redeem the bonds for a slight dilution. This 
market was in excess of $200 billion. See Mikami (1990), Takahashi (1990) and Kuwahara and 
Marsh (1992, 1994) for analyses of the pricing of these warrants. A large percentage of these 
warrants expired worthless because of the 1990-95 decline in stock prices. 

Over-the-counter long-dated typically three year puts were marketed in 1988 by major non- 
Japanese brokerage houses to corporate clients who wished to hedge against long Japanese equity 
exposure or to speculate that the high priced Japanese stocks would eventually decline sharply. The 
sellers of these puts, which typically had premium value of $100000 plus and were priced to trade 
at volatilities around 16—20 % versus the historical 13 %, were mostly large Japanese corporations. 
The corporations displayed a collective arrogance about the strength of the Japanese stock market 
and economy by generally not hedging. The high price earnings ratios in the 70 plus range and the 
astronomically high land prices typified by facts such as the Imperial Palace in Tokyo being 
“worth” as much as all the land in California led professional and amateur investors and econo- 
mists to believe that these high prices could not be sustained (see Aron (1981, 1989), French and 
Poterba (1991), Friedberg (1990), Ueda (1990), Wood (1990) and Stone and Ziemba (1993) for a 
variety of analyses with this conclusion). The first Japanese put warrants available to individual 
investors on an easily purchasable basis were the three year American-type Nikkei put warrants 
that traded on the Toronto Stock Exchange in February 1989. These puts were not true warrants 
as they were cash settled based on the price of the Nikkei Stock Average and were not exercisable 
into stock. Also their issuers were investment banks not individual firms. These warrants provided 
individual investors with the opportunity to bet against the high stock prices in Japan with a 
minimal investment of capital. The “warrants” were thus long dated put options. US investors 
were not allowed to purchase those warrants for three months and these warrants were not 
widely advertised and known outside Canada. 

These warrants were purchased in such demand that their price in implied volatility was well 
above the historical for the NSA index. Most Canadian investors probably had no idea what the 
fair value was. However, despite paying prices up to four times the Black-Scholes (1973) fair value, 
investors who held these put warrants to expiry made very large profits because of the large decline 
in the Nikkei index. The Toronto warrants were of three types: ordinary puts valued in yen, puts 
where the final exchange rate for yen is fixed in advance and puts where the NSA was evaluated in 
Canadian dollars. 

This latter type allowed investors to profit from declines in the NSA or the Japanese yen or 
both. Although in principle straightforward to professionals as discussed in Sections 3 and 4, 
below, besides being unable to evaluate the fair values of these warrants investors were unable 
to evaluate the relative differences between the various types of warrants. Thus, with complex 
instruments, even the most sophisticated in the market needed time to understand the products, 
price them fairly and invest in them to eliminate mispricings.! The warrants also had different 
credit characteristics and when exercised were evaluated on the next day’s closing price of the 
NSA in Japan if there was trading that day otherwise on the next trading day. Investors exercising 
warrants could also put in the proviso that the warrant not be exercised if the NSA rose 500 or 


' Indeed Reiner’s (1992) article in Risk is summarized as follows: “Investors and dealers are increasingly turning to equity- 
linked forex options — then finding they don‘t know how to valuc them. Eric Reiner explains how to adapt Black-Scholes 
and its variants”. 
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more points on the requested exercise day. In late 1989 the type I and type III Canadian NSA put 
warrants were greatly over-priced in comparison to fair values based on historical volatilities 
including that for the 1987 world wide stock market crash period. There was no way for a small 
investor to hedge these instruments. Large investors or institutions could, or course, hedge in the 
futures markets on the SIMEX or in Japan. Indeed this was the way that the issuers who sold the 
puts and were responsible for their exercise payments hedged their investment. 

Grossman (1988) among others has argued that this is not a fully suitable approach because 
the futures synthetic does not have the same information requirements as the underlying derivative. 
Hence the same types of difficulties associated with the breakdown of portfolio insurance in the 
1987 crash could possibly occur in this market as well. See Rubinstein (1988) for an analysis of 
the effect of portfolio insurance on the crash. A better hedge for investors was thus a negotiated 
over-the-counter put on the NSA that essentially matched the Toronto stock exchange traded 
warrants. 

Such instruments were available in late 1989 from investment firms such as the Salomon 
Brothers and Bankers Trust. The authors and others were aware of the potential of shorting expen- 
sive Canadian NSA puts and hedging with a fair priced puts of similar characteristics and duration 
in another market. To short the Canadian put warrants these warrants needed to be borrowed 
since there was a fixed number of them issued. They also had to be shorted according to the 
uptick rule. This was more difficult than shorting an ordinary exchange traded put or call or an 
index option which is essentially from an infinite supply and does not have these restrictions. 
However, it was possible to short Canadian NSA put warrants in large numbers at the high 
implied volatilities. It was expected that the market price of these puts would drop to their fair 
value once a fairly priced product was easily available. Bankers Trust, the Salomon Brothers and 
the Kingdom of Denmark issued such warrants in January and February 1990 which traded on the 
American Stock Exchange. These warrants were all fixed exchange rate securities (of type I, called 
quantos) except for the Bankers Trust January put which was a type I with a floating exchange 
rate. 

Bernard and Thomas (1989, 1990), AMeck-Graves and Mendenhall (1992) and Aberbanell and 
Bernard (1992) have shown that frequently there are considerable delays in the market price adjust- 
ment lo new earnings announcements. Indeed some of these adjustments take several months to be 
fully reflected in market prices. Jacobs and Levy (1988) found that lagged earnings surprises are a 
declining but significant factor in US security prices for one, two and three months after their 
announcement. The convergence of the NSA puts to efficiency were similar and the process 
took over one month from the time the first NSA put warrant was traded on the American 
Stock Exchange in January 1990. Large profits were made by hedgers, including the authors, 
although they took several risks that are difficult to quantify. Besides the credit and exchange 
rate risks (which could be hedged) there was a risk of forced buy-ins of the shorts at unfavourable 
prices because it was no longer possible to borrow the puts. All three of the authors had forced 
buy-ins for a small amount of their position. There was also a profitable hedge between fixed and 
non-fixed exchange rate puts that was affected by the absolute price of the puts. 

Developing models to account for these imperfections is difficult; see Figlewski (1989) for some 
results obtained by simulation. In all cases for the Nikkei puts and the Nikkei calls which are 
discussed in Sections 4 to 6, fixed exchange rate options, the quantos, traded for prices above 
non-fixed exchange rate options when the theoretical price was less. Investors were willing to 
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pay a premium to eliminate this exchange rate risk even though it could have been hedged much 
cheaper in the foreign exchange futures markets. Also investors paid a premium for low nominal 
priced warrants that is analogous to that of low priced stocks, see for example, Blume and Stam- 
baugh (1983). This led to a hedge that was close to arbitrage where investors could buy the high 
nominal value but low implied volatility Bankers Trust warrants and sell the lower priced 
Kingdom of Denmark and Salomon type A and type B warrants that had higher implied volati- 
lities on the same exchange (the American Stock Exchange). These warrants were mispriced for the 
month of February 1990. Except for slightly different credit risk and strike prices these warrants 
were virtually identical. One of the authors used this risk arbitrage to win the US stock market 
championship organized in Barron’s in the category of risk adjusted returns for accounts over one 
million dollars in 1990. 

The paper is organized as follows. Section 2 contains a brief background to the Japanese stock 
market bracketing the time of this study (mid 1989 to mid 1990). Historical volatility is also 
discussed there. Additional references on the Japanese stock market include Elton and Gruber 
(1989), Amihud and Mendelson (1991), Chan, Hamao and Lakonishok (1991), Ziemba and 
Schwartz (1991), Ziemba, Bailey and Hamao (1991), Ziemba (1989ab, 199lab), and Stone and 
Ziemba (1993). 

Section 3 discusses the various NSA put warrants and call warrants that were trading in 1989- 
90 on the American and Toronto stock exchanges and categorizes them into the three types which, 
using the definitions in Rubinstein (1991), are ordinary, product and option to exchange. See also 
Donnelly (1990), Smith and Dunn (1990), and Tufano (1992) for general discussions of these 
warrants. 

Section 4 discusses the fair numerical valuation or the three types of puts using the Cox, Ross 
and Rubinstein (1979), Boyle (1988) and Boyle, Evnine and Gibbs (1989) two and three dimen- 
sional binomial lattice models. Other authors such as Derman, Karsinski and Wecker (1990), 
Babbel and Eisenberg (1991), Rubinstein (1991), Gruca and Ritchken (1991), Clyman (1991, 
1992), Chen, Sears and Shabrokhi (1992), Reiner (1992), Wei (1992), Kat and Roozen (1994), 
and Dravid, Richardson and Sun (1993), have also discussed the pricing of the US Nikkei puts, 
particularly the type II fixed exchange rate quantos. 

Section 5 provides a theoretical basis for comparing the three types of puts. Their values 
depend upon the NSA volatility as well as possibly the exchange rate volatility between the yen 
and the home currency (US or Canadian) and their interactions. Using typical parameter values it 
is shown that type I puts should be priced higher than type II and in turn more than type III. This 
is in contrast to the actual pricing where the type II fixed exchange rate options traded for more 
than the ordinary type I puts during the study period. However, even though they were both over- 
priced in relation to historical volatility the type I and type II Canadian put warrants were 
correctly relatively priced by the market. 

Section 6 discusses the put warrant risk arbitrage and the convergence to efficiency of the two 
mispriced markets: the Canadian versus the US and the US fixed versus non-fixed exchange rate 
puts. Relative option costs for fixed NSA volatility and exchange rate volatility as well as implied 
volatility comparisons are made. The preference for fixed exchange rate options which applied to 
puts also applied to calls which began trading in April 1990. The mispriced securities we discuss are 
referred to as hedge candidates although in most cases they are close to arbitrage. Classical index 
arbitrage is actively pursued in Japan especially by the foreign firms, see for example Miller (1993). 


186 Calendar Anomalies and Arbitrage 


248 Shaw et al. 


Discussions of the potential profitability of such arbitrage appear in Brenner, Subrahmanyam and 
Uno (1989, 1990), Brooks and Yamada (1990), Lim (1992), Chung, Kang and Rhee (1992). 

Section 7 briefly discusses the relationship between the NSA put warrant prices in North 
America and the next day’s cash market in Japan. With deep in-the-money options, the put 
discount or premium signalled the up or down direction of the NSA on the next day in Tokyo. 
For a small data set the conclusion is that the signal was correct 68% of the time. This is consistent 
with the conclusion that futures hedging of these instruments had a strong effect on the Japanese 
stock market. Gruca and Ritchken (1991) also noted similar behaviour on the opening prices 
in Japan. A discussion of implications of the findings and concluding remarks appears in 
Section 8. 


2. The Nikkei stock average 1949—1995 and its historical volatility 


The NSA is a price weighted average of 225 large capitalized stocks traded on the Tokyo Stock 
Exchange. It is defined as 


where D, = divisor at time ¢. The original divisor was Djg49 = 225 and Dj992, Dec = 9-967. Figure 1 
shows the NSA from July 1983 to June 1995. The NSA was 109.9 when it began trading in May 
1949, It peaked at 38916 at the end of December 1989. There were twenty declines” of ten percent 
or more during 1949 to 1989. The index rose 220.84 times in yen and 553.04 in dollars from 1949 to 
1989. There were nine declines of ten percent or more during 1990-92. The index fell to 16925 at 
the end of 1992 a decline of 56.5% from the December 1989 high. Investors from 1949 still had 
96.21 for each yen invested and 277.53 for each dollar invested. The 1990-92 decline had its 
minimum at 14309, a decline of 63.2% from the December 1989 peak, on 17 August 1992. 
There is a very active index arbitrage market in the NSA which has been studied by Brenner, 
Subrahmanyam and Uno (1989, 1990), Chung, Kang and Rhee (1992), Miller (1993), and 
others. The value of the futures volume on the NSA trading in Singapore, Osaka and Chicago 
is the highest of any equity index in the world. 

The press called the stock market decline during 1990—92 the bursting of a ‘speculative 
bubble’. See also Tachi (1993) for a similar conclusion by the Ministry of Finance. Ueda (1990) 
and French and Poterba (1991) among others pointed to the very high Japanese stock prices with 
price earnings ratios of seventy or higher in 1988-89. Stone and Ziemba (1993) have analysed the 
steep rise in the stock and land markets during the 1980s in the era of cheap and readily available 
money and the subsequent steep decline largely caused by the Bank of Japan’s tight money policy 
of raising interest rates and decreasing the supply of money. They concluded that the decline in the 
stock and ‘essential use’ land markets can be explained as an adjustment to changing fundamen- 
tals. Speculative land such as the membership prices of golf courses and condominiums, on the 


> A decline is defined as the peak to valley when the fall exceeds ten percent and any subsequent rise would invalidate the ten 
percent fall. 
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Fig. 1. The NSA weekly close July 1983—June 1995. 


other hand, appears more likely to have been a speculative bubble. Table 4 (see forward) points to 
the high stock prices relative to past levels at the end of 1989. 

This paper studies the period mid—1989 to mid—1990. During 1989 the historical volatility was 
in the 10% range or slightly below its 1949-1989 average of 13%. Volatility has not been 
constant. Figure 2 shows the monthly averages of daily volatility from May 1949 to April 1989. 
While volatility peaked at 73.5% in October 1987 most of the time the annualized standard 
deviation was less than 20%. Volatility tends to rise in declining markets (Schwert, 1989; 
Turner and Weigel, 1992). The 1990-91 period in Japan had historical and implied volatilities 
in the 30—60 % range for much of this period; see Fig. 3. 

The annualized volatility was computed using 


250 22 E 
a= (BE -77) 


NSA, 
r, = 1001n | 


where 


for the last 20 trading days of the month, and NSA, is the closing value of the NSA on day t 
assuming there are 250 trading days per year. 
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Fig. 2. NSA historical volatility, monthly averages of daily data annualized in percent, May 1949-April 
1989, Source: Jun Uno, Nihon Keizai Shinbun, Inc. 


3. NSA put warrants on the Toronto and American Stock Exchanges 
1989-1990 


The three year American style NSA put warrants are of three basic types. Let NSAọ be the strike 
price and NSA, the expiry price of the Nikkei stock average in yen. Let Ep be the current exchange 
rate and ŒE, be the exchange rate on expiry for Canadian or US dollars into yen. The symbol (X)~ 
means the greater of X or zero. Then in yen we have using Rubinstein’s (1991) classification of 
exotic options, where a, b, and c are constants. 


Value on expiry Type of put Currency risk in U.S./Canadian 
dollars? 
I. a[NSAy — NSA)" ordinary yes 
E, 
I. ba [NSAy — NSA,|* product no 
0 
NSAy NSA,\* : a Bae ds 
Ill. c E; ( E Dae E e) option to exchange yes, in index value and in this difference 
0 -e 


with the strike price converted to the 
home currency 


It is convenient to value the puts in yen as discussed in the next section. In their home currency 
(U.S. or Canadian dollars) using the symbols defined in Table 1, the puts (including the Paine 
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Fig. 3. Historical and implied NSA volatility 1989-1991. (——) Trailing 21-day historical volatility, 
(+) Osaka implied volatility. Implied volatility is the average of closest to money NSA puts and calls 
for the nearest month using the Gensaki as the interest rate. Source: Baring Securities. 


Webber and Salomon calls) are: 


Puts Calls 
a = SA + 
I. oo) BT-I, SEK, BTB, London OTC PWA 
e 
— NSA,\* 
I. p(t) BT-III, BT-IV, TFC, DXA, EXW, SXA, 
es yar A SXO, PXB Sal 
TI. Wee ee BT-II 
( Ey Ł, ) 


Each put warrant payoff function can be written as Lmax (strike-underlying, 0) Currency Units. 
That is L put options on the named underlying denominated in the specified currency unit. The 
actual payoff may be in different currency units converted at exercise or expiry at the exchange rate 
prevailing then. The warrants may be traded in different currency units. Neither of these affect the 
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Table 1. NSA classification of put and call warrants trading in Canada and the US in 1989-90. 


Warrant Payoff 


Canadian puts 


Bankers Trust-I(BT-D Cdn equivalent at rate then prevailing of yen 0.1168 (32174 — NSA)* 

Bankers Trust-H (BT-I) Cdn 0.1031 (270.54-NSA,/E,)* where E, is the number of yen per Canadian 
dollar at exercise 

Bankers Trust-III (BT-II]) Cdn $2.50/7.25 % (37 460.32 — NSA)*/37 460.32 = 0.0009205 (37460 — NSA)* 

Bankers Trust-IV (BT-IV) Cdn $2.50/7.25 % (29 843.34 — NSA)*/29 843.34 = 0.0011555 (29843.34 — NSA)t 


Trilon (TFC) Cdn $2.75/7 % (37 416.32 — NSA)* /37 416.32 = 0.0010487 (37416.32 — NSA)* 
SEK Cdn equivalent at rate then prevailing of yen 0.1168 (35963.74 — NSA)* 

US puts 

Kingdom of Denmark (DXA) USS 0.2 (37516.77 — NSA)*/145.33 = 0.0013762 (37 515.65 — NSA)” 
Salomon—I (SKA) USS 0.2 (36821.14 — NSA)*/145.52 = 0.0013744 (36821.14— NSA)* 

Bankers Trust (BTB) USS equivalent at rate then prevailing of yen 0.5 (37206.42 — NSA)* 
Salomon-II (SXO) USS 0.2 (37 471.99 — NSA)*/144.55 = 0.0013836 (37471.99 — NSA)* 

Paine Webber (PXB) USS 0.2 (29 246.06 — NSA)*/159.80 = 0.001252 (29 246.06 — NSA)* 

Salomon Warrant OTC, yen 1.0 (32806 — NSA)* 

London 


A/S Eskportfinans (EXW) USS 0.2 (29424.58 — NSA)*/158.84 = 0.0012591 (29424.58 — NSA)* 


US calls 
Salomon (SXZ) 1/15 (NSA — 28 442.94)* 1/158.8 = 0.00041982 (NSA — 28 442.94)* 
Paine Webber (PWA) US$ equivalent at rate then prevailing of 1/10 (NSA — 29 249.06)" 


underlying put option, but its pricing changes. Thus the fixed characteristics of each warrant 
namely: the leverage factor (L), the strike price (X), the name of the underlying (Under), and 
the currency unit of the underlying (CU) places each of the warrants in the standard form 
shown in Table 1. 


4. Fair valuation of NSA put and call warrants 


All of the warrants involve the NSA index and are American type and are valued in yen and 
may involve an exchange rate. Boyle’s (1988) generalization of the Cox-Ross-Rubinstein (1979) 
binomial lattice model was used to create 3—dimensional lattices to model the evolution of the 
NSA and the exchange rate and their interaction over time. In terms of calculation steps for n 
time steps, the CRR is of order n? and Boyle’s is of order n°. The value of the option is the expected 
present value of the option payoff in an economy in which the drift of a risky asset is the risk-free 
rate minus its dividend yield. The discount factor used to calculate the present value of the payoff is 
the risk-free rate. For the dividend yield, we use the foreign interest rate. The NSA dividend yield 
during 1989 was about 0.5%. While most of the dividends are paid in March and September, see 
Ziemba and Schwartz (1991), the continuous approximation used is accurate because the yield is so 
small. We ignore the typical one day or longer time lag between giving notice of exercise and 
actually being cashed out and other special provisions of the warrants including the credit risk 
of the issuer. We also ignore the possibility of predicting the mean return and use standard 
analysis; see Grundy (1991) and Lo and Wang (1995) for models based on return predictability. 
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4.1. Type I put warrants 


The exercise value is Yen L Max|(X — NSA,),0]. The currency plays no role. At expiry one 
converts yen immediately into dollars. Hence one has a standard put option which may be 
valued on a CRR lattice in yen. 


4.2. Type IT put warrants 


The exercise value is Yen L Max [(X — NSA,), O|E;. 

If the Cov (NSA, E) = 0 then the put is an American style on the NSA which may be valued on 
a CRR lattice.” Since the puts are American, lattice methods are required for accurate price 
evaluation. The interest rate used is that in the foreign pay currency. The dividend yield is replaced 
by the actual dividend yield on the index plus the interest rate differential. The currency risk may be 
hedged away by paying an extra dividend yield equal to the differential. By hedging into the foreign 
currency, the appropriate discount factor is the foreign risk free rate. 

Type II warrants are generalizations of type I warrants in that a type I warrant is a type II 
warrant for which the payout currency is yen. 


4.3. Type LIT put warrants 


The exercise value is Yen L Max [(XE, — NSA,,0]. These warrants are fundamentally different and 
only the Canadian BT-II is of this type. They have fair values above intrinsic even if the NSA has 
zero volatility and they may be valued as an option to exchange with a minor modification of 
Margrabe’s (1978) formula. Two risky assets a, b with values S,, Sẹ have 


payoff = Max (0, Sp — Sa) 
= S, Max (0,1 — S/S) 
= S, Max (0, S;,/S,— 1) 
Thus an option to exchange may be regarded as a put on the value of S, denominated in units of S$, 
or a call on the value of S, denominated in units of S,. Let o, and g, be the volatility of a and b, 
respectively, and p be the correlation of the log of the price relatives. Margrabe’s formula requires 


the dividend yield to be zero in the Black-Scholes put and call pricing. Using the notation: pricing 
formula (option type)(asset, X, T, ø, Div B, Div A) asset, 


BS put (S,/S;, 1, T, o, 0, 0) S, or equivalently BS call (S,/S,, 1. T, o, 0, 0) S,, where 
a = V (0} — 2pa ga + 07) 


? Data shows that Cov (NSA, $Cdn/¥) = 0 and Cov (NSA, $US/¥) = 0. If the Cov (NSA, E) # 0, then one may value 
these puts on a CRR lattice by adjusting owsa to oyga +P Onsaoe With the dividend yield equal to dys4 + rusa/can— 
"Japan + P ansa Te: Sec c.g. Reiner (1992) for proof. Thus a 3-dimensional lattice is not needed. 

+A closed form solution exists for this product option in the European case assuming log normal NSA and log normal 
currency changes since the product of lognormals is lognormal; see e.g. Merton (1973). This is developed specifically in 
Gruca and Ritchken (1991) and Clyman (1991, 1992). The latter author also develops arbitrage relationships updating the 
Merton (1973) analysis to this case. See also Reiner (1992), Dravid, Richardson and Sun (1993), and Kat and Roozen 
(1994). 


Table 2. Prices and implied volatilities of actively traded NSA puts and calls on the Toronto, American and London over-the-counter 
stock exchanges, 23 July 1990. 


18 % 


18% 


Ask Country Implied Vol. Relative Options Intrinsic 

Warrant Type price of issue Leverage Strike Expiry Currency volatility price cost delta value 

BT-I Put 2.530 CAD 0.1168 32174 17 Feb 92 JPY 23.9% $1.76 44% —0.40 $0.25 
BT-II Put 2.530 CAD 0.0011552 29843 15 Jun 92 CAD 26.4% $1.27 99% —0.24 $0.00 
BY-Il Put 6.500 CAD 0.0009203 37460 16 Mar93 CAD 29.3 % $5.15 26% —0.90 $5.12 
TFC Put 8.000 CAD 0.0010487 37460 22 Feb 93 CAD 32.7% $5.87 36% —0.90 $5.84 
SEK Put 4.000 CAD 0.1168 35964 16 Nov 92 JPY 18.3% $3.97 1% —0.71 $3.71 
DXA Put 10.000 US 0.0013762 37516 3 Jan 93 USD 29.9 % $7.78 29% —0.90 $7.74 
SXA Put 9.000 US 0.0013744 36822 19 Jan 93 USD 28.2% $6.94 30% -0.80 $6.77 
SXO Put 9.875 US 0.0013836 37472 16 Feb 93 USD 29.0 % $7.78 27% —0.88 $7.72 
BTB Put 20.500 US 0.5 37206 16 Jan 93 JPY 23.8% $18.16 13% —0.85 $17.93 
PXA Put 3.125 US 0.0012516 29249 8 Apr 93 USD 26.6 % $1.45 16% —0.22 $0.00 
PWA Call 5.500 US 0.1 29249 8 Apr 93 JPY 18.4% $5.47 1% 0.84 $1.79 
SXZ Call 3.500 US 0.0004198 28443 6 Apr 93 USD 15.4% $3.61 3% 0.86 $1.45 
Sal OTC Put 28.50 US l 35750 21 Feb 92 JPY 19.2% $27.62 3% —0.74 $26.03 
Sal OTC Put 2.000 US 1 31033 7 Aug 90 JPY 26.1% $0.90 122% —0.20 $0.00 
Sal OTC Put 9.25 US l 28 139 19 Jan 91 JPY 29.9 % $2.75 236% —0.14 $0.00 
Sal OTC Put 20.25 US 1 32 806 24 Apr 92 JPY 22.8 % $15.42 31% —0.44 $6.15 
Sal OTC Put 35.74 DM* 1 36969 26 Jun 91 JPY 22.7% $34.26 4% -0.99 $34.26 
Sal OTC Call 38.50 US l 28139 19 Jun 91 JPY 18.0 % $38.49 0% 0.87 $25.35 
SalOTC Call 21.50 US 0.5 29278 7 Apr 92 JPY 17.7% $21.60 0% 0.82 $8.83 


*The 36969 Sal OTC put traded in Deutchmarks but is valued in US dollars. 
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In a risk neutral economy the drift of S, and S, is r, the risk free rate, thus S,/S, has drift of zero. 
Margrabe’s formula can be extended to dividend paying assets with payoff ¥(S, — S4), where X is 
a constant as follows. Since S,/S, drifts at rate Div b — Div a where Div is dividend yield. The discount 
rate is Div b since the value is measured in units of S,. The value of an American style option to 
exchange S, for S, is then S,CRR (opt type, S,/S,, X, T, s, Div B, Div A). Hence the pricing is 


L EX,CCR (Put, NSA,/E,,X,T, 0,ig, Div NSA). 


The various trading prices, currency of issue, strike prices, leverage values, expiry dates, 
implied volatility, 18% volatility price, relative cost (% above or below 18%, volatility price), 
18% delta and intrinsic values for the various NSA put warrants are illustrated in Table 2 
for 23 July 1990. The basic data is NSA = 31895, Cdn$ = ¥ 148.13, with interest rates of 
11.75% Canadian, 7.361 % US and 7.319 % Japanese. 


5. Numerical comparison of warrant types I, II and IIT 


The three types of warrant puts may be compared as follows. Assume that the American puts have 
a two year exercise period, the home currency is normalized at 1, the NSA is 100, the Japanese 
interest rate is 6 %, the foreign (Canadian or US) interest rate is 10%, the NSA has a continuous 
yearly dividend of 0.5 % and the standard deviation of the NSA is 20 %. The relative values of the 
three types of puts vary with different assumptions on the volatility of the exchange rate and the 
covariance between the NSA and the exchange rate. Assume that the volatility of the exchange 
rate is 5, 10, or 20 % and that the covariance of the NSA and the exchange rate is —0.5, 0.0 or 0.5. 
Table 3 contains fair values for these warrants in terms of percentage of the NSA. 


Table 3°. Comparison of fair values of NSA put warrants. 


a. Type I Volatility of the exchange rate 
% 10% 20% 
—0.5 7.44 7.44 7.44 
Cov (NSA, E) 0.0 7.44 7.44 7.44 
0.5 7.44 7.44 7.44 

b. Type Il Volatility of the exchange rate 
5% 10% 20 % 
—0.5 7.36 7.57 8.00 
Cov (NSA, E) 0.0 7.13 7.13 7.13 
0.5 6.95 6.77 6.42 

c. Type IM Volatility of the exchange rate 
5% 10 % 20 % 
—0.5 7.03 8.65 12.51 
Cov (NSA, E) 0.0 6.02 6.78 9.51 
0.5 4.90 4.59 5.75 


$ The accuracy of these values depends upon computer implementation including the number of nodes in the binomial 
lattice. Eric Reiner in a private correspondence in 1993 found the following similar values using a 1024 node lattice with 
continuous rates of 0.06, 0.10, and 0.005 corresponding to annualized rates of 0.0161866, 0.1051709, and 0.005S0125: 


Type I: all 7.432 Type II: 7.091 8.717 12.575 
Type II: 7.367 7.575 8.011 6.061 6.841 9.568 
7.167 7.167 7.167 4.933 4.932 5.789 


6.973 6.786 6.431 
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When priced in yen, type I do not involve exchange rates, hence all warrant values are equal. 
With zero covariance between the NSA and exchange rates, the values of type II warrants are the 
same regardless of the volatility of the exchange rates. This value is less than that for type I 
warrants because of the positive differential between the foreign interest rate and the Japanese 
rate. However if the covariance is non-zero, then the value of type II warrants depends both on 
that covariance and the volatility of the exchange rate. In general, the value of the warrant 
increases with the volatility of the exchange rate and for put warrants, decreases as the correlation 
increases. Positive correlation means that negative returns on the NSA are associated with a 
strengthening of the yen. The investor receives returns if the NSA declines so if this is accompanied 
by a stronger yen, the payoff is less at exercise than otherwise would be received. 

Type IT] warrants have the most interesting behaviour. Their value depends on the volatility of 
the exchange rate even when the correlation is zero. In general, the higher the correlation, the lower 
the value of the warrant. With positive correlations, the value of the warrant for both low and high 
values of exchange rate volatility is higher than that for the intermediate. For typical observed 
parameters — covariance zero, exchange volatility about 10% and foreign interest rate above 
Japan’s - type I warrants are generally worth more than type II warrants which are in turn 
worth more than type III warrants, all other parameters (leverage, strike price, time to expiration, 
etc) being equal. There is a similar relationship between the type I (Paine Webber) and type II 
(Salomon) calls (see forward to Figs 9 and 10). 


6. Constructing the put warrant hedges and the convergence to 
efficiency 


The various exchange traded puts in Canada and the US and the over-the-counter puts traded in 
London had many common and several different characteristics that led to significant price differ- 
ences. Reasons for the price differences from fair values include currency and cross border risks, 
different credit risks, difficulties with borrowing for short sales, price effects due to the differing size 
of the warrants, differing strike values, inability to value the warrants properly, differing exercise 
provisions, market sentiment and volatility differences. The London over-the-counter market was 
active in 1988 and 1989 for large institutional investors. Prices were quoted by the market makers 
based on historical volatility (in the 15 % range) plus a profit margin. Salomon Brothers and to a 
lesser extent Bankers Trust, made the market with large bid ask spreads as shown below. On 24 
November 1989, the NSA was 36484 and three of the Salomon Brothers over-the-counter put 
warrants were priced as follows. 
Price in dollars 


Strike Expiry date Price in yen 24 Nov 1989 23 Oct 1990 
32 806 24 April 1992 ¥ 934-970 $6.50—6.75 $65.00 
31033 7 Aug 1990 503-575 3.25-3.75 Expired 
28,139 19 June 1991 75.5—76.5 0.40-0.45 $39.50 


By 23 October 1990, the puts had increased at least ten times and the third issuc nearly one 
hundred times. 
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The Canadian put warrants BT-I and BT-II were the first opportunity for non-institutional 
Canadian and US investors (three months after issue) to profit from a fall in the Japanese stock 
market. BT-I was issued in February 1989 and BT-II in June 1989. These warrants were very 
popular with investors and traded for very high premiums and implied volatilities. 

There was considerable good reason to believe that the Tokyo market was overvalued. See for 
example French and Poterba (1991) for one analysis based on adjusted price earnings ratios and 
Ziemba and Schwartz (1991) for a synthesis of various studies. One way to evaluate this is via Paul 
Aron’s adjusted price earnings ratios which are comparable to French and Poterba’s adjustments 
although somewhat lower. Aron (1981, 1989) computed these ratios to the end of August 1989. His 
values are shown in Table 4. His adjustments reflect different accounting and business practices, 
cross holding effects and different capitalization rates. Ziemba and Schwartz (1991) updated 
Aron’s adjusted values after August 1989 with assumptions concerning the earnings change of 
the NSA and capitalization rates. The values are shown in Table 4 up to 22 February 1991. The 
31 December 1989 value of 23.9 was the highest adjusted price earnings ratio at any time since 1949 
and pointed to extreme risk in the stock market. 

Despite its decline during 1990, it was not until the steep decline on 1 October that these values 
became cheap relative to historical price earnings ratios. Other stock market valuation models such 
as bond and stock yield differences, see Ziemba and Schwartz (1991), also were at historical high 
values at the end of 1989°. All of these models are driven by two factors: earnings forecasts and 
interest rates. The extreme increase in interest rates in 1989 from a 2.5% discount rate to 5.25% 
and later to 6.0% was at the heart of the estimated overvaluation. 

In a multivariate factor model regression study for the period 1979-1989, Ziemba (1989) found 
that future earnings forecasts were by far the most important variable for predicting the rates of 
return of Japanese stocks. 

Hence there was considerable reason for investors to believe that the Japanese market would 
crash or at least decline sharply. Since the Canadian puts were the only product available to invest 
in this belief, their prices were understandably very high, particularly given that it was difficult for 
nearly all of the purchasers of these puts to fairly value them. Seasonality observers also noted that 
the decline in January 1990, while only 4.5 %, was a key negative signal since January has histori- 
cally provided the highest returns in the Japanese markets, see Ziemba (1991b). Moreover, the 
conditional probability of a decline in the rest of the year following a decline in January is quite 
high; see Hensel and Ziemba (1995). 

Table 3 shows that the fair values of type I warrants generally exceed those of type IT which in 
turn exceed those of type III. The BT-I is a type I and the BT-II a type III. In terms of premium, 
see Table 5, the BT-I was priced higher than the BT-II. This was the case in the entire trading 
period from September 1989 to the eventual collapse in February 1990. Table 5 provides insight 
into pricing differences for Canadian—US hedges (A) and US hedges (B), but those provided from 
theoretical option pricing models, as we now discuss, were used in our analysis. For example, 
shorting of BT-I or BT-II and buying BT—US looks like a potentially profitable cross border 


é Whenever the difference was more than two standard deviations above the mean, the market was said to be in the “danger 
zone”. This model was applied throughout the 1949-1989 period. During this time there were twenty declines of 10% or 
more. Every time the markct was in the danger zone during this 40 year period, a decline of 10% or more occurred. There 
were declines of 10% or more without the difference being the “danger zone”. 


196 Calendar Anomalies and Arbitrage 


258 Shaw et al. 


Table 4. Paul Aron’s adjusted PERs for Japan compared with those in the US, 26 April 1981 to 31 
August 1989, with adjustments for later periods to 22 February 1991 by Ziemba and Schwartz (1991). 


Date US PER Japan, adj PER NSA 
26 Apr 1981 7548 
19 Oct 1984 10929 
17 Apr 1986 15827 
26 May 1987 24533 
11 Sept 1987 24829 
31 Dec 1987 21 533 
31 May 1988 26963 
30 Aug 1988 27679 
31 Aug 1989 34808 
31 Dec 1989 38915 
30 Mar 1990 29980 
22 June 1990 31 694 expensive | 
30 Sept 1990 20983 aera 
1 Oct 1990 20022 
2 Oct 1990 22896 
31 Dec 1990 23849 
22 Feb 1991 25903 
Values after August 1989 assume: 

Interest 

Earnings gain 

Date US Japan over Aug 89 
Dec 1989 8.2 6.4 5% 
Mar 1990 8.4 7.4 10% 
June 1990 8.2 7.0 12% 
Sept 1990 8.2 7.75 8% 
1 Oct 1990 8.1 7.5 8% 
2 Oct 1990 8.1 7.5 8% 
31 Dee 1990 7.6 6.5 8% 
22 Feb 1991 7.6 6.0 8% 


hedge trade. Similarly shorting Kingdom of Denmark or Salomon 1 and buying BT-I looks like 
a potentially profitable hedge trade on the American stock exchange. Figures 4 and 5 show the 
theoretical pricing in two ways. Implied volatilities appear in Fig. 4. They illustrate the point. 
However, implied volatilities did not exist at many dates in 1990 when the puts were trading at 
discounts (as discussed later in the paper); see the vertical lines in Fig. 4. Hence, a preferable way to 
compare the warrants prices is by their relative cost. That is actual cost minus theoretical value as a 
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Table 5. Comparison of prices and premium values for four Canadian and three US NSA put warrants on 1 
February 1990. 


% of Hedging actions 
NSA Expiry Years to Premium Premium — 
Warrant Price unit date expiry % per year % A B 
BT-1 $2.70cdn 11.68 % 2/17/92 2.05 20.1 9.8 SELL 
BT-II $1.93cdn 10.31 % 6/15/92 PAA 16.4 6.9 SELL 
BT-HI $2.50cdn 14.29 % 2/16/93 3.05 7.0 2.3 
Trilon Finl $2.75cdn 13.7% 2/22/93 3.05 7.25 2.4 
K of Denmark $5.63us 20% 1/3/93 2.93 10.1 3.4 SELL 
Salomon-I $4.63us 20% 1/19/93 2.97 10.1 3.4 SELL 
BT-US $9.17us 50% 1/16/93 3.00 8.0 2.6 BUY BUY 


percentage of theoretical value. This is shown in Fig. 5 assuming a volatility for the NSA of 20 % 
and an exchange rate volatility of 10%. 

There were no NSA put warrants trading in the United States until the Kingdom of Denmark 
(type II) put warrant began trading on the American Stock Exchange on 3 January 1990. The 
Salomon A (type II) and Bankers Trust (type I) put warrants began trading two weeks later. 
With the availability of these three warrants investors in the Canadian put warrants could 
replace these warrants with the much cheaper US instruments. Figures 4 and 5 show that it 
took more than a month for the Canadian puts to converge to efficiency. A gradual decline 
began with the introduction and market knowledge of the three cheaper US instruments and 
then there was a sudden collapse in late February 1990 just after the second Salomon put 
warrant (a type II) began trading. The slowness of the market to react to new information is analo- 
gous to that of stock prices which are frequently slow to react to new earnings information, see 
Affleck-Graves and Mendenhall (1992), Bernard and Thomas (1990, 1992) and Aberbanell and 
Bernard (1992). The market needed time to understand, evaluate and then fairly price these 
complex instruments. 

The collapse occurred at a time of minor decline in the NSA in February 1990 well before the 
steep declines in March and April 1990. Hedge investors who were able to short the Canadian put 
warrants and buy cheaper US warrants particularly the Bankers Trust BTB could have made 
considerable profits’. 

A second advantageous hedge is illustrated in Figure 6. Despite the fact that the theoretical fair 
values of type I warrants was larger than type II, US investors had a preference for type IT instru- 
ments. Apparently they preferred a fixed exchange rate in dollars upon expiry rather than to value 
the puts in yen. To eliminate the currency risk, they paid more for type II warrants than if they had 
bought type I warrants and hedged the currency risk in the futures market. Hence type I] warrants 
traded for prices which were much larger than those of the type I warrants. There was also a price 


7 This hedge had relatively low risk but was not a truc arbitrage given that there were different credit risks and other 
characteristics of the various put warrants. There was also the difficulty of securing and holding borrowed warrant short 
positions. The threat of forced buy-in was also present. All three authors did have forced buy-ins of short positions, but the 
amount was small so that the overall hedge was very successful. 
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Fig. 4. Implied volatility of BT-I, BT-II, and BTB NSA put warrants assuming exchange rate volatility of 


10 %, 17 February 1989 to 21 September 1990. (A) BT-I, type 1, Canadian, (+) BT-II, type 3, Canadian, and 
(©) BTB, type 1, US. 


effect. The BTB warrant represented 0.5 of an NSA unit and the DXA, SXA and SXOs were worth 
only 0.2 of an NSA. Hence the BTB should trade, other things being equal, at about 2.5 times plus 
or minus a transactions cost band around the other warrants. In fact the BTB usually traded at 
prices much lower. This is analogous to the low priced stock effect that captures much of the 
January small firm effect, see Blume and Stambaugh (1983). These two factors yielded the profit- 
able hedge from January to March 1990. After convergence to efficiency these markets have since 
generally traded within transactions costs bands. 

Figure 7 shows the relative costs of various put warrants in Canada and the US with similar 
strike prices. These warrants were all issued in early 1990 and had NSA strikes between 36 822 and 


8 Additional analysis of the post March 1990 period for various US NSA put warrants , particularly of type IT, appears in 
Clyman (1991, 1992), Chen, Sears and Shahrokhi (1992) and Dravid, Richardson and Sun (1993) among others. Generally 
speaking, after transactions costs are considered, the market was efficient. 
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Rel Cost @ Nik Vol 20%, Exch Vol 10% 
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Fig. 5. Relative costs of BT I, BT II] and BTB NSA put warrants with NSA volatility of 20 % and exchange 
rate volatility of 10%, 17 February 1989 to 21 September 1990. Relative deviation from model price = 
(actual cost — theoretical value)/(theorctical value). (+) BT-I, type 1, Canadian, (©) BT-II, type 3, 
Canadian and (A) BTB, type 1, US. 


37472. Here are shown the higher prices paid for type II warrants in comparison to type I from 
January to the end of February 1990. 

Figure 8 shows the relative costs of Canadian type I, If and HI NSA put warrants. Investors, 
relative to fair prices paid more for type I (BT-I) than for type III (BT-II) until the market 
converged to efficiency in late February 1990. From March to September 1990, ail three types 
of put warrants had relative costs within transaction cost bands. Figures 9 and 10 give the 
implied volatilities and relative costs of the two NSA call warrants traded on the American 
Stock Exchange. The Paine Webber call is a type I and the Salomon is a type H. The fair value 
of a type I should be higher than a type II. However, US investors preferred the type LI with its 
fixed exchange rate of dollars into yen and bid its price higher during most periods from April to 
October 1990. 
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Fig. 6. Relative cost of US type I (BTB) versus US type I] (DXA, SXA, SXO) NSA put warrants, January to 
September 1990, assuming NSA volatility of 20 %. (C1) BTB, type 1, 0.5 NSA, (+) avg DXA, SXA, SXO, type 
2, 0.2 NSA, and (—) normalized Nikkei. 


Table 6. Percent of days the intrinsic value exceeded the market value (Source: Clyman, 1991). 


Month DXA SXA SXO PXB EXW 


Jan 0.0 0.0 na na na 

Feb 0.0 0.0 0.0 na na 

Mar 18.2 22.7 27.3 na na 

Apr 65.0) 70.0 $5.0 0.0 0.0 
May 40.9 40.9 40.9 0.0 0.0 
June 0.0 0.0 0.0 0.0 0.0 
July 0.0 0.0 0.0 0.0 0.0 
Aug 43.5 43.5 43.5 0.0 0.0 
Sep 57.9 52.6 52.6 5.3 0.0 
Oct 52.2 47.8 52.2 4.3 4.3 
Average 29.1 29.5 35.8 1.4 0.8 


Exercise Price 37517 36821 37472 29249 28425 
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Fig. 7. Relative costs puts with similar strike prices, assuming 20% NSA volatility, Yen/Canadian dollars 
volatility of 10%, January to September 1990. (C) BT III, (+) TFC, (©) DXA, (A) SXA, (x) SXO, and 
(V) BTB. 


7. The relationship between NSA put warrant prices in North America 
and the cash market in Tokyo 


During much of 1990 the Toronto and New York NSA put warrants were trading deep in the 
money. Frequently the puts traded for less than their intrinsic value, see Table 6. Tokyo's next 
trading session was the following day. Since much futures hedging was required to protect the 
issuers’ positions, and that trading would lead to index arbitrage if the futures prices deviated 
much from fair value, the prices in North America provided a forecast of the likely prices in 
Tokyo. If the put was trading at a discount one would expect the Tokyo market to rise. Similarly, 
the forecast was for a fall in the Tokyo market if the put was trading at a premium. An indication 
of the size of the market is that during 1990 the NSA puts averaged 13 % of the trading volume 
(and a similar fraction of the trading value) on the American Stock Exchange. Informal estimates 
by the authors of the size of the market in the US and Canada suggests that it was possible that 
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Fig. 8. Relative costs of Canadian type I, II and III put warrants based on 20 % NSA volatility and 10 % Yen/ 
Canadian dollar volatility, January to September 1990. (+) type II, avg BT. UI, TFC, SEK no Volume, 
(©) type I, BT-I, (A) type III, BT-II and (V) normalized Nikkei. 


upwards of 20% of the NSA futures trades in Osaka and Singapore were related to NSA put 
hedging. Consider for example, the SXA Salomon January 1993 NSA put. It had a strike price 
of 36821.14, a currency conversion rate of 145.52 yen per US dollar, and is worth 0.2 of an NSA 
unit. The intrinsic value of the put was 


0.2 (36821.14 — NSA) 
145.52 $ 
Hence the implied NSA is 36821.14 — 5 (145.52)P. Table 7 shows that on the seventeen of the 
twenty-five trading days from 1 August to 6 September 1990, the forecast was correct (six of the 
incorrect predictions were expected rises). 
There are many control aspects to a full study of this relationship such as open versus closed 
prices, futures effects, are futures and warrants giving the same estimate of cash prices, etc? 


? For some progress on this see Dravid, Richardson and Craig (1993) and Yuen (1994). 
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Fig. 9. Implied volatility of the Paine Webber and Salomon NSA call warrants, April to October 1990. (D) 
PXA, and (+) SXZ. 


However, there seems to have been a strong relationship between the price of the NSA put 
warrants in North America and cash prices in Tokyo on the next trading day. 


8. Implications of the findings and concluding remarks 


The paper has described two favourable hedges involving Nikkei put warrants during the period 
November 1989 to February 1990. The cross border hedge involved shorting overpriced Canadian 
Nikkei put warrants which traded on the Toronto Stock Exchange and purchasing either Nikkei 
puts with negotiated terms over the counter in London or exchange traded puts on the American 
Stock Exchange. Since the Canadian puts were unavailable to US investors for three months from 
their issue in February and April 1989 they were not heavily advertised or known in the United 
States. US residents and citizens could have traded them at the time of the hedge, however. The 
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Fig. 10. Relative costs of the Paine Webber and Salomon NSA call warrants using 20% NSA historical 
volatility, April to October 1990. (0) PXA, (+) SXZ, and (—) normalized Nikkei. 


reasons for the mispricing are several. The puts were complex for most ordinary investors and ail 
but experienced option traders in Canada likely evaluated them incorrectly. Evidence of this is 
found in the literature on them from various Canadian brokerage houses. Many investors in 
Canada and academics — see for example, French and Poterba (1991) and Ueda (1990) — were 
quite convinced that the Japanese market was overpriced. Even the Canadian investors bidding 
up of the price did not prevent them from making considerable profits later when the Nikkei fell 
sharply". The Canadian puts finally declined into their theoretically correct pricing about a month 
after the US puts were trading on the American Stock Exchange. 

The studies of Bernard and Thomas (1989, 1990), Affleck-Graves and Mendenhall (1992) and 
Aberbanell and Bernard (1992) show the slowness of individual stocks to react to new earnings 
information. Hence, it is not surprising that this convergence to efficiency of more complex cross 
border investments would take about a month to occur. 


1 According to Slocum (1993) investors in the four Bankers Trust warrants made a total profit of about Cdn$500 million. 
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Table 7. The Salomon Nikkei January 1993 put warrants record at predicting the following 
day’s change in the NSA in Tokyo, | August to 6 September 1990. 


Date SXA Implied Nikkei Nikkei close Prediction* 
8/01 9.375 30 000 30838 fall 

8/02 10.50 29181 30245 fall 

8/03 10.875 28 908 29516 fall 

8/06 12.625 27635 28 600 fall 

8/07 12.00 28 090 27 653 rise 

8/08 11.125 28 726 28 509 rise x 
8/09 11.75 28272 27616 rise X 
8/10 12.875 27453 27 330 rise X 
8/13 13.75 26817 26176 rise 

8/14 12.875 27 453 26673 rise 

8/15 12.375 27817 28112 fall 

8/16 13.875 26726 27 549 fall 

8/17 14.125 26 544 26 787 fall 

8/20 13.50 26 998 26 490 rise x 
8/21 14.75 26089 26 298 fall 

8/22 16.00 25179 25211 fall 

8/23 18.125 23 633 23 738 fall X 
8/24 16.625 24725 24166 rise 

8/27 14.625 26 180 25142 rise 

8/28 15.50 25 543 25711 fall 

8/29 16.00 25179 24 895 rise 

8/30 16.00 25179 25 670 fall X 
8/31 15.625 25452 25978 fall 

9/03 closed na 25420 

9/04 16.25 24997 24908 risc x 
9/05 17.00 24 442 24078 rise x 
9/06 17.75 23 906 23812 


* The prediction is for a fall (risc) in the Nikkei on day ¢ + 1 if the implied Nikkei on day ¢ is below (above) 
the close on ¢. 
Source: Modified from The Wali Street Journal (1990). 


Interestingly, Bankers Trust also issued Canadian dollar against the US dollar put warrants in 
June 1990. These traded on the Toronto Stock Exchange at a time when many Canadians expected 
a sharp decline in the Canadian dollar while US exchange traded options on the Canadian dollar 
were actively traded. These puts were also overpriced and they stayed overpriced for the entire year 
until they and the US puts expired worthless in June 1991. This latter case has some parallels with 
the Nikkei put hedge but important differences. 

With the Canadian dollar puts the difference in price could be explained by the fact that it was 
extremely difficult to short these puts!!. For those that did, including two of the authors, there were 
considerable profits in a percentage but not absolute basis. In contrast it was not difficult to short 
the Canadian Nikkei puts in large numbers. Salomon Brothers and other issuers were in the posi- 
tion to participate in the mispricing hedge. Presumably for business reasons concerned with selling 
such products, they did not converge the mispricing to efficiency sooner than February 1990. The 
market did price the relative values of the type IT and type ITI Canadian puts correctly. Other 


1 Another example in Holland, with similar mispricings related to the inability to short the overpriced warrants, is discussed 
by Veld and Verboven (1992). 
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reasons for the temporary mispricing of the Canadian puts as discussed in the text are the risks of 
buy-ins, cross border risks, small relative currency risks and differing credit risks. The latter is 
mitigated somewhat in the hedge: sell Canadian Bankers Trust puts and buy US Bankers Trust 
puts. Still if an extraordinary event occurred and there was no trading in the Nikkei index the 
liquidity of the two types of puts could have been different. 

The second hedge involved securities of fixed versus floating exchange rate on the American 
Stock Exchange. The explanation for this mispricing which also lasted about one month seems to 
be the price effect and the different ways one can view the currency risk and pricing. The price effect 
where the Bankers Trust US warrants had NSA sizes two and half times as large as the Kingdom of 
Denmark and the Salomon puts is totally analogous to the effect of low priced stocks in January 
studied by among others Blume and Stambaugh (1983). It is known that much of the January small 
firm effect can be equally viewed as a low price effect. Hence, it is not surprising that in the very 
beginning of their trading the much higher nominally priced BT warrants traded for somewhat 
lower actual prices. Another possible reason for the discrepancy involves the currency risk. The 
theoretical models assume that currency prices are based on their forward rates. Hence, if investors 
were assuming that the lower yielding yen would not appreciate against the higher yielding US or 
Canadian dollars as evidence summarized by Froot and Thaler (1990) suggests for such currencies, 
then higher prices were warranted for the fixed exchange rate puts'”. Since even this explanation, 
that is, assuming that the forward rate equals the spot is not enough to explain the full extent of the 
mispricings it appears that a combination of the two effects and the premium that is warranted for 
eliminating the currency risk is the logical explanation. 
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Chapter 8 


DESIGN OF ANOMALIES FUNDS: 
CONCEPTS AND EXPERIENCE 


Dennis R. Capozza and William Ziemba 


INTRODUCTION 


During recent years, many researchers and practitioners have hunted for 
deviations from market efficiency in a quest for excess returns. The big game 
in this hunt is a bona fide “anomaly.” The hunters have flushed out some 
veritable “beasts” that are disconcerting to the more doctrinaire efficient market 
advocates but heartening to the antagonists. However, if anomalies are to be 
more than elusive and ephemeral prey, a strategy must be developed that 
realizes an excess return. Implementation of such an anomalies strategy is the 
subject of this paper. 

In this context, market efficiency is usually defined in terms of the capital 
asset pricing model (CAPM). The simple version of the CAPM can be written 


(rs) = rp + Bl Elm) — ry) 
where E(r,) is the expected return on a risky asset 
ry is the yield on a riskless asset 
Erm) is the expected return on the market 


Bs ìs the relative risk measure for the risky asset. 


As suggested by many writers, if the model is correct and security markets 
are efficient, security returns will, on average, conform to the above relation. 
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Persistent departures from the CAPM violate the joint hypothesis that both 
the CAPM and the efficient market hypothesis are correct. “Anomalies,” or 
security market regularities are departures from the joint hypothesis. While 
CAPM is the dominant asset pricing model against which efficiency has been 
tested and anomalies observed, these anomalies appear to be pervasive even 
when other valuation theories such as APT are used (see Giiltekin and Giiltekin 
1987; Cho and Taylor 1987). 

In the next section we describe ways to structure an anomalies fund. In the 
third section we survey some exploitable anomalies. The fourth section outlines 
the experience of two anomalies funds. The fifth discusses problems of 
implementation and the final section concludes. 


WHAT IS AN ANOMALIES FUND? 


If a true anomaly exists it should be possible to construct a portfolio of 
securities that will earn an excess return with little or no risk. For example, 
suppose low P/E stocks outperform high P/E stocks on average. Then a 
portfolio that is long low P/E securities and short high P/E securities in the 
right proportion should earn the excess return with little or no systematic risk. 
In practice it may be difficult to eliminate all risk with offsetting short positions, 
but in theory it is possible to do so. If a portfolio is constructed in this low- 
risk manner we have a “pure” anomalies fund. A pure anomalies fund should 
have an excess return with little or no systematic risk. 

To create a portfolio with a higher expected return, one need only reduce 
the short positions in the portfolio. This portfolio with only long positions 
could be constructed with an overweighting of “anomalous” securities. The 
portfolio will be risky; but if the stocks are bona fide anomalies, the returns 
on this portfolio should exceed those on the market over a sufficiently long 
horizon. The portfolio will have risk characteristics similar to the market 
portfolio unless the anomaly stocks (e.g., low P/E) are exceptionally risky.' 
If portfolios are constructed in these risky ways, the portfolio is an anomalies 
“growth” fund. 

Most fund managers are, in effect, following a strategy similar to this latter 
concept. In this paper, however, we reserve the term “anomalies fund” for 
portfolio strategies based on the anomalies which have been documented in 
the academic literature to have statistically significant excess returns. 


EXPLOITABLE ANOMALIES 


While a wide variety of phenomena has been proposed as possible exploitable 
anomalies. the best documented anomalies include those associated with 
seasonality (January, monthly, weekly, and holiday). insider trading, 
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unexpected earnings, and closed-end mutual funds. Other anomalous behavior. 
such as during tender offer repurchase of shares (Lakonishok and Vermaelen 
1988), are not explored here. Surveys of equity return anomalies are available 
in Jacobs and Levy (1988) and Keim (1986). 


Seasonality -January 


The abnormal return at the beginning of the year, the “January effect,” has 
reccived a great deal of attention in the academic literature. Because of its size, 
persistence, and pervasiveness, the January effect is the “beast” among the 
anomalies. For small firms 40% of the return for the year occurs in January 
(Rogalski and Tinic 1986). The same seasonal pattern appears in stock markets 
around the world (Haugen and Lakonishok 1988; Gültekin and Gültekin 1983). 

In attempting to explain the January effect, researchers have found it to be 
more pronounced for small firms (Keim 1983a:; Reinganum 1983; Roll 1983; 
Ritter 1988), for low P/E stocks (Jaffe, Keim, and Westerfield 1989), and for 
stocks that have declined sharply (DeBondt and Thaler 1985, 1987). When 
combined with these other anomalies, the January seasonal is large enough 
to support active portfolio strategies. That is, since the excess return exceeds 
typical transactions costs, an investor should be able to buy stocks in December 
for liquidation at the end of January profitably. Lakonishok and Smidt (1984), 
however, caution that the small-firm effect in January arises from errors in 
variables due to non-trading of small-firm stocks at the turn of the year. The 
use of the bid-ask spread mean for price by CRSP (Center for Research in 
Security Prices) and the regulation of market-makers may lead to biased results 
in empirical tests. However, even these authors do find a significant abnormal 
return for small firms, before brokerage, at the end of the calendar year. 

This active strategy contrasts with the “virtual” strategy that must be used 
with many of the smaller anomalies. When transactions costs exceed the 
anomaly'’s excess return, for example, with the “Monday effect” discussed 
below, an investor cannot profit from an active strategy of buying and selling. 
Instead, the anomaly can help to displace trades in time to favorable moments. 
For the Monday effect the investor would delay purchases until late Monday 
and advance sales to the preceding Friday close. 

Many active strategies can be devised to exploit the January effect. These 
include buying stock of small firms with low P/E ratios in December, buying 
firms with the largest price declines, and buying Value Line futures while 
shorting the S&P futures (Clark and Ziemba 1987). This last strategy would 
be suitable for a pure anomalies fund while the others are suitable for an 
anomalies growth fund. A pure anomalies fund could also short large firms 
with high P/E ratios, index futures, or options. This will eliminate the risk 
in the long positions of the small firm, low P/E strategy. 
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Seasonality-Monthly, Weekly, and Holiday Seasonals 


Other well-documented seasonal anomalies include the monthly, weekly, and 
holiday effects. Ariel (1985) found that returns in the first half of the month 
are significantly larger than returns in the second half (except in February). 
The first nine trading days have an average return of 1.4%, while the last nine 
average —.02%. 

French (1980), Gibbons and Hess (1981) and Keim and Stambaugh (1984) 
find a small but significant negative average return on Monday of about —.14%. 
Ariel (1987) reports a seasonal around legal holidays. 

None of these is large enough for the average investor to exploit via an active 
strategy, although the monthly seasonal may be exploitable for low-cost 
traders. Therefore the virtual strategy outlined above must be employed. Stock 
purchases can be timed to coincide with the beginning of the favorable period 
and stock sales can be timed for the end of favorable periods. 


Insider Trading 


Insider buying and selling signal the future price movements of a stock for 
up to six months following the insider trades (Seyhun 1986, 1988). This 
anomaly is small but can be combined with other anomalies such as the small- 
firm, low P/E, and overreaction effects of January. A pure anomalies fund 
would buy firms with net insider buying and sell short firms with net insider 
selling. An anomalies growth fund would simply overweight the portfolio with 
insider buying securities and underweight with insider selling securitics. 


Closed end Funds 


Thompson (1978) has found that buying closed-end funds that trade at a 
large discount to net asset value (e.g., greater than 20%) provides excess returns. 
This anomaly is both small and risky but can be combined with other anomalies 
to yield a tradeabie strategy. A pure anomaly fund would buy closed-end funds 
at large discounts and short funds at small discounts. 


Unexpected Earnings 


Foster, Olsen, and Shevlin (1984) and Jacobs and Levy (1988) find persistent 
price behavior around the announcement of unexpected earnings. The excess 
returns persist for six months following the announcement. Bernard and 
Thomas (1989) argue that most of the drift following earnings announcements 
takes place around the subsequent quarter’s earnings announcement. That is, 
it appears that the market fails to appreciate fully the implication of one 
quarter’s earnings for the next quarter’s earnings. Strategies similar to those 
outlined above can be used to exploit this anomaly. 

A summary of some of these findings appears in Table 1. 
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Type 
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A Partial List of Findings on Stock Market Anomalies 


Authors 


|. Small firm and P/E effect 


a) Small firm 


b) P/E Effect 


IL Seasonality 


a) Monthly 


b) Weekly 


Ht. Insider 
Trading 


IV. Closed End 
Funds 


V. Unexpected 
Earnings 


Keim (1983) 


Reinganum (!983) 


Roli (1983) 


Rogalski and Tinic 
(1986) 


Ritter (1988) 


Jaffe, Keim, 
Westerfield (1989) 


Ariel (1987) 


French (1980) 


Gibbons & Hess 
(1981) 


Keim & Stambaugh 
(1984) 


Seyhun (1986) 


Thompson (1978) 


Foster, Olsen and 
Shevlin (1984) 


Bernard and 
Thomas (1989) 


Finding 


Annualized abnormal returns difference of 30.3% 
between small and large firms. Difference is 15.4% 
in all months excluding January (1963-1979). 


Negative correction between size and abnormal 


returns,e even after adjusting for tax loss selling 
(1963-1979). 


January returns primarily on last day of December 
and first four trading days of January. 6.89% for 
NYSE, 14.2% for Amex: 5 day return (1963-1980). 


EW market portfolio earns average daily return of 
0.34% in January—which ts at least 4 times larger 
than other months (1963-1982). 


Mean difference between small and large firm 
returns on first 9 days of January is 0.00876% 
(1970-1985). 


Moving from lowest to highest quintile of E/P 
ranked firms increases returns by 3.2% annually. 


Cumulative return of 1.4% on first 9 days of 
month, —0.02% return on last 9 days of month 
(1963-1981) 


Mean Monday returns = —. 166 (1953-1977) 
Mean S&P 500 Monday returns = —.13% (1962- 
1978) 

Mean Monday returns = —.199% (1928-1982) 


1.1% abnormal return following insider trade 100 
days after reporting (1975-1981) 


Average abnormal return in excess of 4% per year 
on trading on closed end funds trading at a dis- 
count (1940-1975). 


Sign and magnitude of earnings forecast error 
explains (81% of) post announcement drift in stock 
returns. 


5% difference between excess returns of high and 
low decile SUE firms over 60 days following unex- 
pected earnings. 
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EXPERIENCE 


The ultimate test of an anomaly is whether strategies based on the anomaly 
yield excess returns. Unlike a statistical study, a portfolio manager must deal 
with commissions, bid/ask spreads, unanticipated events, power and telephone 
outages, human emotions, “fast markets,” and other practical considerations 
that cannot be fully accounted for in a study. In this section we describe the 
results of an attempt to follow an anomalies strategy by the Canadian Asset 
Research Institute (CARI). 

Canadian Asset Research Institute has managed both a pure anomalies fund 
and an anomalies growth fund on an experimental basis since early in 1987. The 
CARI Anomalies Fund attempts to exploit the anomalies described above by 
buying securities with the indicated characteristics and selling index futures or 
options to reduce the systematic risk. During seasonal periods that are particularly 
favorable (e.g., early January) the fund may leave the long position unhedged 
or only partly hedged by short futures and options. 

The success of the fund at converting the anomalies into excess returns at 
low risk can be assessed from Table 2. 

As can be seen in the table the first year of operation, which includes 
the October crash, was not very successful for this fund. The excess return 
is negative and the risk level is moderate with the beta at .65 (65% of the 
risk of the market index). The second year on the other hand, is much more 
successful. The excess return is about 1% per month while the risk is very 
low with beta equal to .15. Unsystematic risk also declines in the second 
year. 

The CARI Growth Fund attempts to exploit the same anomalies as the 
CARI Anomalies Fund but leaves the longs unhedged or less hedged. Figure 
1 below summarizes the experience of this fund relative to the S&P 500. As 
with the Anomalies Fund, the CARI Growth Fund is more successful at 
achieving its aims in 1988 than in 1987. 


Table 2. CARI Anomalies Fund Risk and 
Return (percent per month) 
(rs: = Gs + Bstm + Est) 


1987 1988 
alpha —.311% 1.45% 
beta 0.65 0.15 


unsys. risk 4 1% 3.3% 


617 


na 


Net Asset Value 
Monthly Data 1987/88 
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Figure 1. CARI Growth Fund and S&P 500, 1987-88 
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PROBLEMS OF IMPLEMENTATION 


The most difficult problems associated with the anomalies strategies arise in 
hedging positions to reduce risk. As a result, the pure anomalies concept is 
more difficult to implement than the anomalies growth concept. 

To maintain a proper neutral hedged position, both the risk in the long 
position and the risk in the hedging instrument (stocks, stock index futures, 
or stock index options) must be monitored. Since the risk can change both 
with the portfolio positions and with market fluctuations, there is considerable 
opportunity for both errors and transactions costs to rise. Events like the 
October 1987 meltdown tend to greatly increase total trading costs 
(commissions and execution expenses). For the CARI Funds many problems 
with hedging the positions arose in the first year—especially as a result of the 
October 1987 crash. 

When anomalies are exploited by taking short positions in stocks, it is 
important to be able to earn full interest or invest the proceeds on the shorts. 
The excess returns in most anomalies is small so that the loss of interest can 
eliminate the excess return. Usually it is not possible to obtain full interest 
credit. Consequently, alternative positions in index futures or index options 
where interest is credited tend to be more attractive. 

Many of the anomalies involve positions in small firms. These securities tend 
to be thinly traded with large (in percentage terms) bid-ask spreads (Stoll and 
Whaley 1983). As a result, great care must be taken in the execution of trades. 
Large orders will drive prices up by more than the small excess return predicted 
by the anomaly and eliminate any expected gains. Low priced securities are 
particularly vulnerable since bid/ask spreads on these securities can exceed 10% 
of the market price. 


CONCLUSION 


Recent research has uncovered anomalies that violate the joint hypothesis of 
market efficiency and the CAPM. Experience suggests that the anomalies 
research can help to construct portfolios that earn excess returns. The most 
significant of these anomalies is the January effect which is intertwined with 
other anomalies. Only time will tell whether these anomalies will persist in the 
face of an onslaught of portfolio managers attempting to exploit them. 
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Chapter 9 


Land and Stock Prices in Japan 


Douglas Stone and William T. Ziemba 


trillion. This was more than 20 percent of the world’s wealth, or to put it 

in some other contexts, about double the world’s equity markets or half 
again as large as the world’s bond markets. Japanese land was then valued at 
about five times that of the United States; the land under the Emperor’s Palace, 
which is about three-quarters of a square mile, was estimated to be worth about 
the same as all the land in California or in Canada. Real estate assets of 
Japanese corporations grew by $2.8 trillion from 1986 to 1988, an increase in 


L: late 1991, the total land value in Japan was estimated at nearly $20 


valuation roughly equal to the size of the Japanese gross national product. 

An equally dramatic rise in stock prices accompanied the rise in land 
prices. At its peak in December 1989, the Japanese stock market had a value of 
about $4 trillion, which was about 44 percent of the world’s equity market 
capitalization. To put that figure in perspective, the value of the equity on all 
the stock exchanges in the United States in August 1992 was less than $5 
trillion. But then, from its peak in December 1989 to August 1992, the 
Japanese stock market fell by over 60 percent. Various indices of speculative 
land values fell a similar amount. Meanwhile, other land prices—industrial, 
commercial, residential, as measured by various indices—fell 15-20 percent. 

This paper discusses the rise of Japanese stock and land prices in the past 
four decades and their dramatic decline in the early 1990s. To what extent can 


m Douglas Stone is a member of the Research Department of the Frank Russell 
Company, Tacoma, Washington. William T. Ziemba is the Alumni Professor of Manage- 
ment Science, University of British Columbia, Vancouver, Canada. 
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fundamental factors explain both the price levels and the returns from land 
and stock in Japan? Are land prices driving stock prices, or the other way 
around, or are stil] other factors affecting both? How has government policy 
interacted with the price changes? In practice, it is very difficult to solve the 
problem of separating the explanation that a bubble occurred from the possibil- 
ity that the underlying fundamental problem is misspecified, a difficulty ex- 
plained by Flood and Hodrick (1990). We believe that the bulk of the rise in 
Japanese asset prices from 1985-89 and the decline during 1990-92 was 
driven by interest rate and credit market conditions. However, in certain 
speculative land markets, there is some evidence in support of the bubble 
hypothesis. 


Rational Explanations for Japan’s Land Prices 


The most expensive land in the world is in central Tokyo. In the Tsukamoto 
Sozan Building in Ginza 2-Chome, one square meter was priced at 37.7 million 
yen, or about U.S. $279,000 at the December 1990 exchange rate of about 135 
yen per U.S. dollar. The average lot for a U.S. house, at the 1990 price of 
$9000 per square meter in Japan, would cost about $9 million. Tokyo is 
especially crowded and congested and space is in great demand. For example, 
the fines for parking in a no-parking zone are as high as 200,000 yen (about 
$1667), which is about two weeks pay for an average worker. A cup of coffee at 
a major Tokyo Hotel can cost $4, with the high price based on the product, the 
service, and the high cost of the rent of the land. 

It’s clear that economic fundamentals can at least explain why Japanese 
land prices are higher than elsewhere. Japan’s population, at 120 million, is 
about half that of the United States. However, Japan’s area of 377,000 square 
kilometers is only about 4 percent of the United States, an area about the size of 
Montana. Moreover, the habitable area of Japan is only about 80,000 square 
kilometers, about 1/60 of the U.S. amount, an area about the size of South 
Carolina. Thus, in terms of density, Japan’s population per unit of habitable 
area is about 30 times that of the United States. Japan’s GDP per unit of 
habitable land is approximately 21 times as large. 

Moreover, the use of Japan’s habitable land is severely restricted by various 
regulations and tax laws. For example, the tax structure creates low costs for 
holding land and very high costs of selling land. Inherited land is valued well 
below the market price for tax purposes. Since other financial assets are valued 
at full market price, investors, seeking to lower ordinary and estate taxes, invest 
and hold land often with borrowed money. Existing land is underutilized 
because of regulations on zoning, height restrictions, and so on. Laws dating 
back to World War II, particularly the Building Lease Law, have made it 
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virtually impossible for a landlord to evict tenants even when the lease expires.’ 
As a result, rather than building rental property, a large number of land-owners 
hold vacant land or turn the land into parking lots while waiting for an 
attractive opportunity to sell.” Noguchi (1991) and Mera (1992) argue that it is 
land utilization, not the amount of land in Japan, that is crucial. 

Perhaps the most extreme illustration of these restrictions is Tokyo itself. 
Remember, Tokyo has the world’s highest land prices and it is an outlier in 
virtually all comparisons of value, be they price or rent. However, there is still 
considerable vacant and underutilized land in greater Tokyo. A survey by the 
Ministry of Construction concluded that about 160,000 acres were available for 
housing. This includes 89,000 in farmland, 56,800 in underutilized land (va- 
cant and parking lots) and 14,800 in vacated factory sites, publicly held idle 
land, and land of the now defunct National Railways. In addition, most builders 
do not utilize the legally allowed capacity. According to the Japan National 
Land Agency, as of 1986 the legally allowable floor-to-land-area ratio in Tokyo 
was 242 percent, however, the actual figure was 95 percent, for a usable ratio of 
39.3 percent. 

Boone and Sachs (1989) and particularly Boone (1989) have attempted to 
use facts like these to justify the high land values in Tokyo and the rest of Japan 
based on rational economic models. Their explanations fall into three cate- 
gories. First, Tokyo is an extreme outlier in prices and rents, so the high prices 
might be due to inefficiencies and excess concentration in that area. However, 
even without Tokyo, Japanese land prices greatly exceed those in the United 
States and Europe. Every prefecture in Japan has a measure of land value 
relative to GNP which is greater than any of the major industrialized countries. 

Second, distortions between the urban and rural sector, including tax 
policy and agricultural protection, also affect land prices by disfavoring urban 
land, and thus increasing its scarcity. Agricultural land is taxed less than a tenth 
as heavily as residential, commercial or industrial land. Ando and Auerbach 
(1990) have estimated that taxing agricultural land at the same rate as other 
land would increase the availability of residential land enough to reduce 
housing costs by 28 percent. This estimate seems high, since it seems to assume 
that freed-up rural land can be a perfect substitute for urban land. Boone 
(1989) took this approach further in two ways: he incorporated data on how 
rents decline as one moves away from a city center, and found this would lead 
to a 3 percent drop in aggregate land value. However, he also calculated that if 
agricultural policy were liberalized, a 9-12 percent drop in aggregate land 
values would result. Because these factors explain only a small proportion of 


"Legal changes in 1992 may make it easier to evict tenants. 
? Mera (1992) shows the optimality of this decision from an economic point of view. He also shows 
how this could drastically change with a substantial increase in land taxes. 
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the difference in land prices, Boone suggests that the major explanation for the 
high land values in Japan are macroeconomic factors. 

In a series of models, Boone is able to rationalize the high land prices. The 
long-run determinants of land values (measured relative to GNP) are the share 
of rents in an economy-wide aggregate production function and the capitaliza- 
tion rate for future rents. In turn, rents depend on productivity growth in the 
economy, high utilization of land, and low property taxes (to keep the rate of 
return high). Future rents depend on these factors as well, discounted with the 
appropriate interest rate. Several models, along the lines of Cass (1972) and 
Solow (1973), all predict that economies with some combination of low rates of 
time preference, high productivity growth rates, and low property taxes 
(through their effect on the required return for land) would tend to have high 
measures of land value relative to GNP in the steady state, high growth rates 
and high savings rates. 

From the standpoint of models like these, Japan seems to have all the 
ingredients for high land values. Average property taxes are low: the average 
rate was just 0.39 percent in Japan in 1988, less than one-fourth the 1.73 
percent property tax rate in the United States. So it has been easy to hold land 
with low taxes and low costs of borrowing and interest charges that are tax 
deductible. The intensity of land use in Japan is 20 to 30 times that in the 
United States. Real growth rates in Japan have exceeded those in the rest of the 
world over the last few decades, and estimates of the relative growth rates by 
Data Resources in 1989 were 4.0 percent for Japan and 2.3 percent for the 
United States for the next decade. Finally, Japan has had a low rate of time 
preference, perhaps one-quarter to one-third the U.S. levels, with high savings 
despite low interest rates. 

In 1985, Japan’s land value to GNP compared to U.S. land value to GNP 
was in a ratio of 2.88: 1. Given the much smaller land area of Japan, this figure 
implies Japanese land prices in the range of 80-120 times U.S. land prices. 
From 1985 to 1987, the ratio of Japan’s land value to GNP, compared with U.S. 
land value to GNP, increased to 4.3: 1. Boone (1989) calculates that the 1985 
ratio is plausible enough, based on the underlying assumptions. The jump to 
1987 requires that Japan’s growth rate be perceived to increase about 2 percent 
faster, forever. Thus, Boone has argued that as ridiculous on the surface as the 
Japanese land prices are, they are more or less in the ballpark of a rational 
economic explanation, provided that Japanese required rates of return remain 
low and real growth remains high.* This underscores the crucial importance of 


“along with Boone (1989), Noguchi (1991), Rose (1990) and Ziemba (1991a) have estimated land 
price equations based on fundamental data. Using data on 47 prefectures from 1977-87, Noguchi 
(1991) found that the log of price in 100 yen per square meter was negatively related to the 
long-term interest rate, and positively related to the real prefecture GDP per unit of urban land, 
the population growth rate within that prefecture and neighboring prefectures, and the share of 
secondary and tertiary industries in the prefecture. Rose (1990) shows that land values and housing 
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the interest rate used by the market, which will become even more apparent in 
discussing the reason for the decline in asset prices from 1990-92. 

In short, Japanese land prices are largely explainable through standard 
economic variables. Moreover, if the government in Japan wished to reduce 
land prices, it has at its disposal many tax and regulatory instruments which 
could do so. Unless such steps are taken, however, Japan’s land prices will 
probably remain extremely high and unaffordable for most of its citizens. 
Current Japanese government economic policy is to reduce the price of the 
average accommodation to five times the average annual salary. The 1990-92 
decline has so far reduced this ratio from about nine to about seven. 


The Relationship between Land and Stock Prices 


Land and stock prices have been in a very close, positive relationship since 
1955. In fact, Figure 1 shows that land and stock rose almost the same amount 
from March 1955 to September 1992. Calculations in Ziemba (199 1a) show that 
the stock market level is most closely related to commercial land prices and the 
correlation in these biannual series up to 1988 was over 99 percent. In the 
period 1955-71, land prices increased more than stock prices, with land 
increasing 100-175 times in the six largest cities and stocks increasing about 
109 times. From 1971-89, the stock market increased about 20 times while the 
various types of land only increased about five to ten times in the six largest 
cities, and three to four times in the country as a whole. 

However, stocks have been much more volatile than land prices. For 
example, the 5 to 8 percent decline in various land prices in 1973-74, following 
the first oil crisis, is the only decline in those series before the early 1990s. 
Stocks, on the other hand, have had 29 declines of 10 percent or more from 
1949 to 1992. Figure 2 illustrates the differences in stock and land volatility, 
which is dramatic even with biannual data. 

Since many economists have little familiarity with Japanese statistics on 
land and stock prices, a few words describing the data seem appropriate. Data 


rents are positively related to population and/or per capita income, inversely related to the interest 
rate, inversely related to the supply of land and inversely related to distance from the city center. 
Rates of growth in income or population generally do not contribute significantly to the explana- 
tion of prices. Per capita real income or population, and the real interest rate or the inflation rate 
exert some influence over land value in the directions implied by theory. 

Interestingly, Rose also found, using data from 1954 to 1987, that the real interest rate (the 
“all banks average agreed interest rate on loans and discounts” net of the rate of increase in the 
CPI) and the general price inflation have a correlation of nearly — 1 (during 1969 to 1984 it was 
— 0.99) so these variables are essentially substitutes. Part of the reason for this is that Japanese 
inflation is highly dependent upon the yen-U.S. dollar exchange rate which in turn is related to the 
interest differential. 
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Figure 1 
All Land prices and the Nikkei Stock Average, 1955:1-1992:2 
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on land prices is available from the Japan Real Estate Institute on indices of 
land prices for industrial, residential, commercial, the highest-priced lots, and 
all land in the six largest cities and throughout the country from 1955 to the 
present.’ In addition, the Japan National Land Agency provides appraisal 
information on several thousand properties on an annual basis. 

The data are appraisal-based. Simple averages of samples of three sets of 
ten lots in each city form the indices. The sampling procedure separates land 
into high, medium and low grades reflecting location, social circumstances, 
yield, and so on. Lots are randomly selected from these three classes. The 
procedure produces useful but not ideal data for analysis because the indices 
tend to be lagged (largely based on dated information) and smoothed (the data 
tend to be averaged and when prices fall there is a tendency not to sell 
properties). Glaringly apparent in Figure 1 is the lagged price declines of land 
in 1992. Gyourko and Keim (1992) discuss some of the problems associated 
with real estate data. Data are available for the end of the fiscal year (March 31) 
and the half fiscal year (September 30). 

Daily transaction data of a price-weighted index of 225 large capitalized 
stocks called the Nikkei Stock Average (NSA) is available beginning when the 
stock market in Japan reopened in May 1949, following the occupation after 
World War II. This index is comparable to the Dow Jones Industrial average in 


4 The six largest cities are Tokyo, Yokohama, Nagoya, Kyoto, Osaka and Kobe. The country-wide 
indices are based on 140 cities and recent data on 225 cities. 
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Figure 2 
Rates of Return All Land and the Nikkei Stock Average, 1955:1-1992:2 
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Source: Stone and Ziemba (1992) 


its construction, with the same wide acceptance and problems. A more repre- 
sentative index of the market is the Topix, which is the value-weighted index of 
all Tokyo Stock Exchange “first section” securities, some 1225 as of August 
1992. The “first section” amounts to about 85 percent of the trading and 
market capitalization in Japan. Daily values of this index are available since its 
use began in 1968, and earlier values can be calculated from available data. The 
Topix is comparable to the S&P 500.° The NSA and Topix are highly intercor- 
related and they are both used in land and stock price relationship studies. The 
Topix is more highly capitalized because it is value-weighted, even though the 
NSA has the largest individual capitalized stocks. The Topix is also much more 
highly concentrated in banking stocks, who are the owners of most of the 
land-related debt in Japan. 

Stock prices of individual securities and land are often intertwined in the 
Japanese market. Cutts (1990) discusses the Japanese policy of borrowing on 
stocks to buy land, and the reverse. The major purchases of land during the 
1970s and 1980s were by corporations; over this time, the household sector has 
been a net seller to the non-financial corporations and other sectors. The 
corporations have had the resources to purchase land, some of which are used 
to house their employees with subsidized rent. The net purchases of land by the 
non-financial sector was 28 trillion yen during 1985-89 versus 3 trillion yen in 


®' There is extensive futures trading including substantial index arbitrage in the NSA in Singapore, 
Osaka and Chicago. In fact, the NSA futures contract has higher dollar volume than any other 
index futures contract, including the S & P 500. 
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the previous five years. The extra 25 trillion yen was about 4.4 percent of their 
567 trillion yen in holdings at the end of 1989. 

Ziemba and Schwartz (1991) discuss examples of the land component of 
major corporations including adjustments to price earnings ratios that separate 
out the hidden assets (land and cross-held stock) from the firm’s earnings. If 
one does the latter, using appropriate indices for valuation, the price-earnings 
ratios of many stocks fall dramatically. For example, Nippon Steel, Japan’s 
largest stecl company, had a price-earnings ratio of 101.5 in 1988. Without 
land it was 83.3, without buildings it was 74.3 and only 5.3 without cross 
holdings.® 

Do Japanese stock and land prices literally move together? Or is one 
leading the other? There is strong evidence from a variety of sources that stock 
price changes lead land price changes. For example, Stone and Ziemba (1990) 
found strong evidence that stock price returns led land price returns from 1972 
to 1987, rather than the reverse.” Canaway (1990) estimated the lag to be 
eleven months, which is consistent with the general character of Stone and 
Ziemba’s (1990) calculations. When we updated the calculations in Stone and 
Ziemba (1990) for our 1992 paper, including data for the period 1987 to 1992, 
there was little change in the values, and the conclusion that stock returns lead 
land returns is maintained. Hamao and Hoshi (1991) also find that excess 
returns of stocks are useful in predicting excess land returns, while excess land 
returns are not useful in predicting stock returns.” 

This sort of strong connection between stock and land prices does not seem 
to occur everywhere; for example, investigation of data for the United States 


ĉA simple model to value a Japanese corporation is to aggregate the land, securities and other assets 
net of debt, using appropriate discount factors. While inventories, receivables, bonds and equity are 
valued at close to market interest rates, the land components held directly or indirectly are priced 
by the market at 10-20 percent of their current market price and evaluated in financial statements 
at extremely low book values. The latter are often one-thirtieth of current market value. The 
market is able to discount large land holdings because it is virtually impossible to initiate a hostile 
takeover of a Japanese company (Zicmba and Schwartz, 1992). 

‘Stone and Ziemba (1990) estimated econometric models that use past quarterly stock price returns 
(measured by the Topix index) in an attempt to predict the current quarter’s overall land price 
returns, including a dynamic regression model with autoregressive terms, a Box-Jenkins modcl, 
and an autogressive conditional heteroscedastic model. The estimation period was 1972:2 to 1989:1 
and the forecast period was 1989:2 to 1990:3. During the estimation period, the models all fit well 
with coefficients having the expected signs. However, only the dynamic regression model predicted 
well out of sample. Stone and Ziemba (1992) reestimated a similar equation for the NSA, rather 
than the Topix index, to include 1990-91 data. Again, the equation predicted well in the forecast 
period which suggests that the past relationship of lagged stock market returns with current land 
returns is still valid. Based on these equations, the recent decline in stock prices should imply a 
drop of about 20 percent in Japan’s land prices during 1992 and 1993. 

5tamao and Hoshi (1991) found that land index returns and the Nikkei 225 price returns are not 
co-integrated. Two variables are co-integrated if they move together in the long run. Using data 
through 1992, Stone and Ziemba (1992) also found this lack of co-integration. Recent research 
indicates that tests of co-integration may not be very powerful (Hakkio and Rush, 1991). Further- 
more, while land and stocks have moved together in the past, as shown in Figure 1, they have 
diverged significantly at various umes, including the recent past. 
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found no significant relationship between stock and land price returns (Stone 
and Ziemba, 1990; 1992). However, there are significant relationships between 
lagged real estate stock returns and real estate returns in the United States 
(Gyourko and Keim, 1992). But before discussing possible reasons for this 
connection in Japan, we should first turn to an issue that may have occurred to 
some readers—the problem of speculative land indices. 


What About Speculative Land Prices? 


While stock prices may generally lead land prices, it has sometimes been 
argued that one class of land—speculative land—leads stock prices. Speculative 
land is the term given to land which is held for expectation of gain or as a 
hedge against inflation; it is contrasted with land that is an essential input for 
business operations or housing. In Japan, speculative land is typically highly 
levered, while essential use land is not levered. As one example, in early 1988 
The Economist magazine hypothesized that golf course membership prices gen- 
erally lead the stock market, based largely on the fact that golf courses fell in 
the quarter prior to the October 1987 stock market crash. 

Golf course membership indices for various cities and areas of Japan, 
which are compiled by the Nihon Keizai Shimbun, Inc. (Nikkei), provide one 
measure of speculative land values. The value of all golf courses in Japan at the 
end of 1989 was about $500 billion, a value double the Australian or Swiss stock 
exchanges. There is a strong correlation between golf course membership 
prices and other speculative land prices, such as condominiums, but better data 
is available on golf course memberships. These indices, computed weekly since 
the end of 1981, are based on actual sales of memberships at the more than 400 
golf courses in Japan. The index is updated at the end of each week by Nikkei 
from interviews with the six major golf membership dealers who deal in the top 
400 clubs. The brokers supply the current bid and ask prices of each club and 
the index is the arithmetic average of these.” Membership in a golf course in 
Japan allows play by the member at nominal fees and the ability to bring paying 
guests as well as a share in the course including the land. Much social and 
business activity is conducted at golf courses where the ability to be in a less 
crowded environment is highly valued. 

The golf course membership index in Tokyo and the stock market gener- 
ally move in tandem, since both of these indices reflect economic activity which 
is centered in Tokyo. However, there have been two recent cases where golf 
course membership prices rose far above the NSA, before the gap was closed. 
As shown in Figure 3, the first occurred in 1986-88, thus occurring both before 
and after the October 1987 stock market crash. In 1986, golf course prices 
moved above stock prices, which in turn moved above land prices. Golf course 


9 i 3 ae Se eit, : . he St 
When there are no bid or ask prices, statistical substitution is used to adjust the index to maintain 
continuity. 
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Figure 3 
Quarterly Tokyo Golf Course Membership Prices and the Nikkei Stock 
Average, 1982-1992. 


NSA led Golf 
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Source: Stone and Ziemba (1992) 


membership prices started falling in the quarter prior to the October 1987 
world-wide stock market crash and by early 1989 stocks and golf course prices 
were in balance again. Then in January 1990, stocks began to fall in reaction to 
higher interest rates, but golf memberships and small capitalized stocks did not 
fall until March 1990 and all land in September 1990. The prices in late 1992 of 
both the NSA and Tokyo golf courses seem to be consistent with Japan’s 
economic growth in the 1980s with the excesses of the 1985 to 1989 period 
eliminated. 

Based on these episodes, it is difficult to conclude that prices for golf 
memberships, or speculative land more generally, lead stock prices. Stone and 
Ziemba (1990, 1992) offer formal confirmation of this result. However, they do 
find evidence that golf courses lead land price returns more generally, and that 
stock price returns lead land price returns, even when land is defined to 
include golf course membership returns. The evidence is that a move in stock 
prices impacts golf course prices in about three months and land in nine to 
twelve months. 


Has Japan Experienced a Speculative Bubble 
in Land and Stock Prices? 


Japanese stock prices fell over 60 percent from their peak at the end of 
1989 to their valley in August 1992. Meanwhile, speculative land prices (as 
measured by the golf membership index) fell over 50 percent by August 1992 
and by over 60 percent in early 1993. While these declines are enormous, they 
don’t quite measure up to some of the other spectacular collapses in financial 
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market history. For example, the U.S. stock market in the Great Depression 
declined by 87 percent; the Hong Kong stock market fell by 92 percent in the 
early 1970s; the Mexican stock market had fallen by 78 percent in 1981; and 
Taiwanese stocks had fallen by 80 percent in 1990. To take an even older 
example, the “tulipmania” experience in Holland led to a 93 percent decline in 
value in 1637. 

Despite the dramatic declines in stock prices and speculative land, the fall 
in essential land prices from 1990 to 1992 has been only about 15~20 percent. 
This fall is nonetheless striking, because Japanese land index values had not 
previously fallen in the postwar period, except in the aftermath of the 1973-74 
oil crisis. The latter decline was very small, despite a major restructuring of the 
Japanese economy and a 37.4 percent drop in the NSA. Prices began rising 
again after six months. Japanese land did not decline in the second oil crisis, 
which did not have the shock impact of the first oil crisis. Moreover, remember- 
ing that land and stock prices have tended to move together, with stock prices a 
bit in front, it would seem that land prices remain extremely high. 

In the end, can the level of Japanese land and stock markets, or the sharp 
movements of those markets in the 1980s, be explained by fundamental factors, 
like interest rate movements? Were they explainable for a time, and then no 
longer explainable? Let us first state the case for explaining these price 
movements with fundamental factors, and then the case for believing that some 
sort of speculative bubble occurred. 


Evidence on Fundamental Value 

Japanese land prices were often 100 times greater than those in the United 
States. Differences of that magnitude are consistent with economic rationality, 
assuming that the intensity of land use in Japan is 20-30 times higher and the 
required rate of return on rents is about one-fourth to one-third as high. Both 
of those conditions were consistent with the data up to 1989. Boone’s (1989) 
analysis is also consistent with the 1990-92 decline with an upward adjustment 
of the required rate of return at a time of increased interest rates. Liu and Mei 
(1991) also find support that stocks and real estate prices are based on 
fundamental values. Ziemba and Schwartz (1991) and Ziemba (1991b, 1993) 
find that Japanese stock prices react to similar fundamental and seasonal 
factors as U.S. stocks, with earnings growth the most important variable for 
stock price changes. Gampbell and Hamao (1992) show that the dividend price 
ratio and interest rate variables are related to excess returns in Japan. 

Not only are the levels of asset prices consistent with economic rationality, 
but the boom and bust cycle of the 1980s and 1990s can also be explained by 
rational factors—primarily movements in the short and long-term interest 
rates. Figure 4 shows the short-term interest rates in Japan from June 1984 to 
August 1992. The dramatic decline in rates in 1985-86 following the Plaza 
accord in September 1985 in the wake of the substantial appreciation of the yen 
and the expansion of the monetary supply coincided with the more than 
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Figure 4 
Short-Term Interest Rates in Japan, June 18, 1984—August 12, 1992 
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doubling of land prices and the sharp rise in stock prices from 1985 to 1987. 
Interest rates then stayed low from 1987 to early 1989. Then the Bank of 
Japan’s tight money policy in 1989 and 1990 led to successive increases in the 
discount rate from 2.5 percent to 6.0 percent, which presaged the decline in 
stock and speculative land prices. Even cheaper money was available for large 
corporations in the late 1980s through equity warrant bonds, which are traded 
mainly in London.'® The 1980s were a period of cheap and easily available 
money. In contrast, the 1990s have been a period of expensive and hard to 


With currency hedging, the bonds with these detachable warrants had net cost of borrowing of 
—2 percent to +2 percent. The basic warrant bond was an ingenious product of the 1980s. A 
three-year bond with a detachable warrant provides an interest rate of, say, 4 percent versus the 
market rate of 8 percent in U.S. funds because of the value of the warrant. Currency hedging the 
bonds proceeds in dollars back into yen with a yen interest rate of (say) 3 percent below that of the 
dollar provides a net cost of 1 percent for the money in yen because the future yen value must be at 
about a 3 percent premium per year to the U.S. dollar to avoid riskless arbitrage. In addition, when 
the warrants are exercised, the firm receives a substantial cash infusion in exchange for a slight 
dilution of the equity. Higher yen interest rates versus the dollar and a weak stock market (no 
longer receptive of additional supply such as these warrants) essentially halted this market in 1990. 
Recent issues are largely in marks or Swiss francs which have high interest rates. Kuwahara and 
Marsh (1992) discuss this market and the pricing of these warrants. There are some $100 billion 
plus of these warrants that will expire worthless if the stock market does not rise considerably in 
1993-94. 
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arrange financing. The low short-term interest rates shown in Figure 4 do not 
reflect the true cost of most borrowings in 1992. For example, ten-year 
government bonds whose rates were slightly higher than the short-term rates 
in 1987-89 rose to a peak in August 1990 slightly below the short-term rates 
and then declined in 1992, but at a slower rate than the short-term rates. In 
late 1992, these rates were about 150 basis points higher than the short-term 
rates. 

An alternative way to test whether fundamental values can explain stock 
prices is to look at price /earnings ratios. While the NSA was trading in the late 
1980s at P/E ratios significantly higher than those in the United States, much 
of the difference can be explained by alternative accounting, timing and related 
institutional factors, as shown by Aron (1981, 1989) and French and Poterba 
(1991). Paul Aron (1989) has computed adjusted Japanese price-earnings ratios 
on a continuing basis since 1981. He found that the Japanese P/E ratios have 
been roughly comparable to those in the United States in the 1980s, when one 
adjusts for the differing required rates of return in the two countries. French 
and Poterba (1991) arrive at a similar conclusion, using a somewhat different 
approach.''! However, since calculations like these were completed, interest 
rates have declined substantially in the U.S. and the U.S. stock market has 
risen. Short-term interest rates in Japan in August 1992 were at levels as low as 
in 1986. It is no longer clear that price-earnings ratios continue to reflect only 
the difference in required rates of return between the U.S. and Japan. 

Calculations in Ziemba and Schwartz (1991) argue that once the required 
rates of return in the two countries are adjusted for current interest rates, the 
market in December 1989 was greatly overpriced. This model also accurately 
reflects the rises and declines in Japanese stocks during 1990-91. However, it 
may be that investors use lagged- or levered-interest rates in their calculations, 
rather than current rates, or that investors are not fully rational and partially 
base their decisions on investor sentiment, as in the noisy trader hypothesis of 
Shleifer and Summers (1990). That point of view is consistent with the finding 
that overpriced stocks were followed immediately by two sharp declines during 
1990. Further evidence in support of this view was the continual decline in the 
equity indices in 1991-92 while interest rates were in a substantial decline. 


Speculative Bubble Evidence 

How does one test for a speculative bubble, or even think about such an 
event? One common way of making the distinction is to say that if price levels 
and movements can be explained by fundamental factors, then no bubble 
exists. Conversely, following Stiglitz (1990), we assume that a bubble exists if 
the reason that the price is high today is only because investors believe that the 
selling price will be higher tomorrow when fundamental factors do not seem to 


"French and Poterba (1991) focus on the 1986 doubling of price-earnings ratios. ‘They argue that 
accounting differences can explain about half the long-run differences in United States and 
Japanese price earnings ratios but not the 1986 doubling. 
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justify such a price. While in principle straightforward, it is difficult in practice 
to separate the bubble from the changes in fundamentals in a particular asset 
market. An examination of econometric work concerned with bubbles versus 
fundamentals by Flood and Hodrick (1990) concluded that no study has yet 
managed to solve the problem of separating the bubble movements from the 
possibility that the underlying fundamental problem is misspecified. 

With these warnings in mind, Ueda (1990) argued that stock prices under- 
esumated the value of the corporate assets during 1970-83, but overestimated 
the value of these assets since 1983. The extreme increase in stock prices in the 
1980s occurred during a period of declining required risk premiums in relation 
to rates of return. Hence, the sharp increase in prices of stocks prior to their 
steep decline in 1990 was at least partially based on expectation of future 
increases in asset prices of land and stock shares held. French and Poterba 
(1991) present similar analyses and conclusions. 

Japanese land in 1991 was worth about $20 trillion dollars, over 20 percent 
of the world’s wealth. From the mid-1950s to the mid-1980s, Japanese land 
values increased 11 times as much as rents. Rose (1990) also found that land 
prices increased more rapidly than income, whereas housing rents increased 
less rapidly then income in this period. Noguchi (1991) found that land prices 
are far in excess of the discounted sum of rents and pricing equations. 

An alternative approach which lends some insight is to investigate the 
distribution of prices. When Rachev and Ziemba (1992) did this for 
the distribution of the golf course membership prices, for example, the result- 
ing distribution had very fat tails compared to typical distributions for U.S. 
stock prices (Fama and Roll, 1971; Akgiray and Booth, 1988) and Japanese 
stock prices and was not normally distributed.'!? The continual decline of golf 
course prices during 199] and early 1992 had a marginal effect on the 
parameter estimates. The fat tails means that the probability in any short time 
interval of a large increase or decrease is very high. 

We have no evidence that recent Japanese stock prices have distributions 
that are consistent with the bubble hypothesis. However, this has not been 
adequately studied; one study we are aware of is Tse (1991), who used data 
prior to the 1990-92 stock market decline and excluding the October 1987 
crash, and found no evidence of a bubble. Rachev and Ziemba (1992) using 
data from 1983~92 found Japanese stock prices do fit a stable distribution well 
and have tails with mass intermediate between U.S. stocks and Japanese golf 
courses. But perhaps this uncertainty about a bubble in the stock market is to 
be expected, since current studies are not even able to agree whether an event 
like the October 1987 crash of the U.S. stock market was a speculative bubble. 
To cite a few examples, Froot and Obstfeld (1991), Hardouvelis (1988), and 
Miller (1990) argue that there was a speculative bubble, while Dezhbakhsh and 
Demirguc-Kunt (1990) argue that prices followed a random walk based on 
current information. 


'2Mittnik and Rachev (1993) survey the modeling of asset returns with stable distributions. 
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Overall, we would say that there is some evidence for a bubble in specula- 
tive land prices, as measured by the golf course membership index. Remember, 
purchases of such speculative land were highly levered. On the other side, the 
distribution of price changes seems to offer little evidence (at least so far) for a 
bubble in the market for essential land or the stock market. 


Final Remarks 


The decline in Japanese financial markets has put intense financial pres- 
sure on the banks and other financial institutions. In particular, the banks must 
satisfy the Bank of International Settlements regulations in 1993 (Shibota, 
1991). This caused the government to provide a major fiscal policy stimulus in 
August 1992. This, along with very low interest rates, would seem to presage a 
change in investor sentiment in Japan. The economic evidence seems to point 
to lower land-prices in the near-term—about 20 percent according to the 
regression models—but perhaps some recovery in the stock market (whose 
market capitalization has fallen well below its worldwide share of economic 
activity), thus gradually bringing the value of these two assets back toward their 
historical congruence. 

As to the question of whether a speculative bubble actually occurred or not, 
the answer seems to involve rather subtle matters of definition. Low interest 
rates in the mid-1980s, combined with the interrelationship between stock and 
land markets, clearly helped to trigger a boom. However, it appears likely that 
the boom went somewhat beyond what could be justified based on fundamental 
factors. On the downside, then, Japanese government policy formulated in 
1988 and implemented beginning in 1989 was to deflate the so-called “bubble 
economy” through much higher interest rates (Flack, 1990; Shale, 1991). These 
rate increases triggered the initial decline in asset prices. But as more and more 
selling occurred, there was a strong multiplier effect with individuals and 
corporations that were highly levered in stocks and land being forced into more 
financial trouble because of the decline in the market thus leading to their 
forced selling. This effect was strong enough that even though short-term 
interest rates had returned by 1992 to a level lower than those at any time in 
the mid to late 1980s, and long-term rates were as low as they had previously 
been, asset prices remained low. A factor in this is the availability of credit. It is 
simply much more difficult in 1992-93 to arrange financing than it was in the 
late 1980s. 

On one hand, it would seem something of an artificial distinction to say 
that movements in interest rates are a fundamental factor, but the cycle of 
leverage these movements unleashed are a speculative bubble. On the other 
hand, if the simple definition of a bubble is when investors are buying and 
selling only on the basis that prices will rise further, than surely using the high 
price of previously purchased assets as collateral for loans to buy still more 
assets should qualify as a bubble. 
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Chapter 10 


THE CHICKEN OR THE EGG: 
LAND AND STOCK PRICES IN JAPAN" 


William T. Ziemba 
Faculty of Commerce, University of British Columbia, Vancouver, B.C. V6T 1Y8 


Tsukamoto Sozan Building in Ginza 2-Chome in central Tokyo is built on 
the most expensive land in the country with one square meter priced at 
¥37.7 million or about $279,000 U.S. at the (December 1990) exchange rate 
of about ¥135 per U.S. dollar. 


Abstract 


It is well known that land prices in Japan are the world's highest and that the 
stock market has increased dramatically. This paper explores the relationship 
between the price levels of these two markets. The results indicate an 
extraordinarily close relationship, particularly for commercial land. While the 
prices of land and stocks are highly related, the evidence is that stock prices lead 
land prices and not the reverse. 


Land Prices in Japan are Astronomical 


Some 120 million people live in Japan in an area of about 377,800 km? which 
is about the size of Montana. Most of the land is mountainous or is used for 
agriculture. Indeed, less than 5% of the land is used to house all the people. A 
breakdown of land use appears in Table 1. Hence, with high incomes, crowded 
conditions, and an intense desire to invest at home, land prices in the most 
desirable locations have escalated beyond belief. Almost 30 million people, or a 
quarter of the population, lives in the greater Tokyo area. Much of the housing 
in Japan's major cities is owned by the large corporations and their employees 
receive subsidized rent. However, fully 60% of families in Japan, and 55% in 


* Portions of this research were conducted under William T. Ziemba’s direction at the Yamaichi 
Research Institute in Tokyo in 1988/89. Without implicating them I would like to thank my 
colleagues there, particularly Hirokazu Yuihama and Hitoshi Ishiyama for their help. Thanks 
are also duc to Mr. Motohiko Higashikawa of the Japan Real Estate Institute for supplying data to 
me. This research was also partially supported by the Centre for International Business Studics, 
Univeristy of British Columbia and the Social Sciences and Humanities Research Council of 
Canada. 
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Tokyo, own their own homes and many individuals invest in raw land or in 
apartments that they rent to others. 


Table 1:_Land Use in Japan (1986) 
Agriculture 14.5 


Woodlands 66.9 


Moors 0.8 
Rivers 3.5 
Roads 2.9 
Dwellings 4.0 
Other 7.4 
Total 100.0 


Total Area 377.8 (1000 km?) 
Source: National Land Agency, Japan 


Table 2 and figure 1 give the Japan Real Estate Institute's land indices for the 
six largest cities, and for all of Japan for commercial, housing, industrial and total 
land for each six month period from March 1949 to September 1990. The indices 
are for March (1) and September (2). Figure 1 also gives the yearly rate of changes 
as of March of each year. The six largest cities are Tokyo, Osaka, Nagoya, 
Yokohama, Kobe and Kyoto. The country wide indices are based on 140 cities. 
The data are appraisal based which tends to smooth the price levels and lag the 
market. Simple averages of samples of ten lots in each city form the indices 
which were normalized at 100 as of March 31, 1980. The sampling procedure 
separates land into high, medium and low grades reflecting location, social 
circumstances, yield, etc. The sampling procedure selects lots randomly and 
equally from each of these three classes. 

Table 2 also indicates that the price increase has been largest in the six largest 
cities. Despite large recent rises, the relative gain in the period 1955 to 1970 was 
much larger than from 1970 to 1990. For land in the whole country, the 1955 to 
1970 period produced gains of about 15 times 1955 values. These prices then 
increased only about four fold in the ensuing twenty years. In the six largest 
cities, the increase was also much larger in the 1955 to 1970 period versus the past 
two decades. 
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Table 2: Land Indices in Japan 1955-1990 


Nationwide Six Largest Cities Consumer 
Al Com- Housing Indus- All Com- Housing Indus- Price 
mercial trial mercial trial Index 
1955:1 3:1 3.7 2.5 3:2 2.4 4.9 1.7 22 
1955:2 3.3 4.0 2.7 3.4 2.6 5.1 1.9 2.3 
1956:1 3.5 4.3 2.9 3.7 2.8 5.6 1.9 2.6 
1956:2 3.9 4.8 3.2 4.1 3.2 6.1 2:2 3.0 
1957:1 4.5 5.5 3.6 4.8 3.6 6.7 2.6 3.5 
1957:2 5.0 6.1 4.0 5.4 4.2 7.4 2.9 4.1 
1958:1 5.5 6.6 45 5.9 4.6 7.8 3.3 4.7 
1958:2 6.1 7.2 5.0 6.5 5.0 7.9 3.6 5.3 
1959:1 6.8 8.1 5.5 7.3 5.5 8.4 4.0 5.9 
1959:2 7.7 9.1 6.2 8.3 6.3 9.6 4.6 6.8 
1960:1 8.7 10.5 6.8 9.5 7.2 11.2 5.2 7.9 
1960:2 10.2 12.4 7.8 11.4 9.3 14.1 6.0 10.6 
1961:1 12:3 14.4 9.3 14.6 12.1 18.0 7.5 14.8 
1961:2 14.5 16.7 10.8 17.4 15.7 22.9 9.5 20.0 
1962:1 15.7 17.9 11.8 19.1 17.3 24.3 10.5 22.2 
1962:2 17.1 19.2 12.8 21.0 19.0 25.4 11.9 24.2 
1963:1 18.4 21.0 13.6 22.7 20.5 27.1 13.1 26.1 
1963.2 19.6 22.1 14.5 24.5 22.3 29.0 14.5 28.4 
1964:1 21.0 23.5 15.5 26.3 24.1 31.1 15.9 30.4 
1964:2 22.5 25.4 16.6 28.0 25.6 32.9 17.1 32.0 
1965:1 23.8 26.6 17.8 29.6 26.4 33.8 17.8 33.1 
1965:2 24.4 27.5 18.2 30.1 26.6 34.1 18.0 33.2 
1966:1 25.0 28.4 18.9 30.5 26.9 34.5 18.4 33.2 
1966:2 25.8 29.5 19.6 30.8 27.2 34.8 18.9 33.2 
1967:1 27.1 31.4 20.8 31.9 28.1 36.3 19.6 34.0 
1967:2 28.8 33.3 22.4 33.2 29.2 37.6 20.7 35.0 
1968:1 30.8 35.7 24.2 35.1 30.5 38.8 22.0 36.3 
1968:2 33.2 38.6 26.3 37.4 32.4 41.1 23.9 38.1 
1969:1 36.1 41.8 29.0 40.2 35.1 44.4 26.1 40.9 
1969:2 39.6 46.1 32.2 43.2 38.1 48.0 28.7 44.1 
1970:1 43.2 49.9 35.5 47.0 41.3 51.4 31.3 47.8 
1970:2 46.8 53.8 38.7 50.7 44.8 54.1 34.4 52.0 
1971:1 50.0 57.0 41.8 54.1 48.0 56.5 37.2 55.8 
1971:2 53.2 59.8 44.9 57.8 51.0 58.8 40.0 59.4 
1972:1 56.5 63.2 47.8 61.8 54.1 61.7 42.8 62.7 
1972:2 61.2 67.8 52.1 66.7 59.6 66.7 48.1 68.6 
1973:1 70.8 76.6 61.6 77.0 71.2 77.0 59.2 80.8 
1973:2 81.1 85.9 71.8 88.5 81.1 85.8 68.5 91:7 
1974:1 87.0 91.4 77.8 94.8 84.1 88.7 71.0 95.3 
1974:2 88.2 92.4 79.1 95.7 84.5 89.1 71.5 95.4 
1975:1 83.3 87.8 74.6 89.7 77.3 82.1 65.6 86.7 100.0 
1975:2 83.5 88.0 75.0 89.7 77.5 82.1 66.2 86.8 
1976:1 83.9 88.3 75.7 89.9 78.0 82.4 66.9 86.8 108.3 
1976:2 84.7 88.7 77.0 90.2 78.8 82.8 68.3 87.1 
1977:1 85.7 89.3 78.7 90.7 79.9 83.9 69.9 87.4 118.1 
1977:2 86.9 90.0 80.5 91.1 80.9 84.5 71.6 87.8 
1978:1 88.1 91.0 82.5 91.8 82.2 85.7 73.6 88.4 112.6 
1978:2 89.8 92.2 85.1 92.9 84.5 87.4 77.2 89.7 
1979:1 92.1 93.9 88.7 94.4 88.3 90.3 82.9 92.0 127.0 


1979:2 95.5 96.5 93.7 96.8 94.2 95.3 91.6 96.0 
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Table 2. Land Indices in Japan 1955-1990 (continued) 


Nationwide Six Largest Cities 
Consumer 
All Com- Housing Indus- All Com- Housing Indus- Price 
mercial trial mercial trial Index 


1980:1 100.0 100.0 100.0 100.0 100.0 1000 100.0 100.0 137.2 
1980:2 104.6 103.7 1065 103.5 1046 1043 106.2 103.4 
1981:1 108.7 106.9 112.2 106.7 108.5 108.1 110.6 106.7 143.7 
1981:2 112.8 110.3 117.6 109.9 1124 1125 114.8 109.9 
1982:1 1164 1134 1224 1126 115.7 116.7 117.9 112.6 147.6 
1982:2 119.4 1160 1264 115.0 118.5 1206 120.2 114.8 
1983:1 121.9 1183 1295 117.0 121.3 124.6 1225 116.7 150.5 
1983:2 124.0 120.2 131.9 118.9 123.9 1285 1246 118.6 
1984:1 125.8 122.0 134.1 1203 127.6 135.7 1268 1206 153.8 
1984:2 127.6 123.8 1360 121.8 132.2 1444 1296 123.1 
1985:1 129.3 125.7 137.7 123.2 137.1 153.6 133.8 1250 157.0 
1985:2 131.0 127.8 139.2 124.5 143.3 167.1 138.1 127.1 
1986:1 133.0 130.9 140.7 125.8 156.6 197.9 146.7 131.2 158.1 
1986:2 135.6 134.9 1427 1273 173.8 2293 165.3 137.0 
198721 140.2 141.1 147.0 130.1 197.2 264.7 1863 1536 158.3 
1987:2 149.9 153.3 156.1 137.0 234.8 336.7 216.1 175.4 
1988:1 154.2 159.9 159.3 140.1 252.2 375.2 2294 183.2 159.3 
1988:2 159.2 166.9 162.7 144.7 279.9 420.0 240.0 213.1 
1989:1 165.9 175.9 168.1 150.1 318.8 467.5 264.5 243.7 163.4 
1989:2 174.8 1868 176.0 157.7 356.6 528.8 300.7 279.4 
1990:1 189.3 203.9 189.6 170.2 408.2 599.2 253.0 315.7 
1990:2 203.1 220.55 202.9 1814 429.1 625.9 372.4 331.7 


# times increase in ¥ 

1955:1 to 1990:2 65.5 59.6 81.2 56.7 178.8 127.7 219.1 150.8 
1955:1 to 1970:2 15.1 14.5 15.5 15.8 18.7 11.0 20.2 23.6 
1970:2 to 1990:2 4.3 4.1 5.2 3.6 96 11.6 10.8 6.4 


Land values in the six largest cities have outpaced the CPI by twenty times 
since 1955. In the Ginza district of Tokyo each square meter of land is worth over 
$200,000 U.S. Choice downtown land in Tokyo goes for the equivalent of nearly a 
billion dollars an acre. At neighboring land prices, the value of land under the 
Emperor's palace and garden in Tokyo equals that of all California or of Canada. 
The golf courses of Japan alone are worth more then the entire Australian stock 
market, some A$250. The total land value in Japan in 1990 was about 4.1 times 
that of the whole United States. Japanese land was worth some ¥2180 trillion as 
of the end of 1989. This compares with a value of ¥1050 trillion at the end of 1985. 
Using an exchange rate of ¥143.76 per dollar at the end of 1989, gives a land value 
of $15.16 trillion. As of September 1990, all land had an index of 203.1, up 16.2% 
from September 1989. With an end of 1990 exchange rate of 135.40, total Japanese 
land values were in the $18.7 trillion range in late 1990. The average acre of land 
in Japan is worth fully 100 times the average acre in the U.S. So even though the 
U.S. has about 25 times more land than Japan, its current total value is less than a 
fourth as much. Essentially half the world's land value at 1987-90 prices is 
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accounted for by Japanese land! It also accounts for about 20% of the total asset 
value in the world. Simple houses in Tokyo rent for more than $10,000 per 
month and cost in the millions. Office space for sale in Tokyo's financial district 
costs nearly $75,000 per square foot. Some luxury apartments in Tokyo rent for 
well over $20,000 per month. Figure 2 compares land prices throughout Japan, in 
1988 with Tokyo normalized at 100. Osaka was then 62, Nagoya, 27, and most 
other metropolitan areas in the 12 to 18 range. Figure 3 compares land value by 
region from 1983 to 1987. 


Figure 1: Land price indices for industrial, residential, commercial and all land 
and annual rates of price change for all land, 1955 to 1990 


a. In the six largest cities 
(semi log scale) 
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b. In the entire country (140 cities) 
(semi log scale) 
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Source: Japan Real Estate Institute 


In 1988 Tokyo's land value alone was about $7.7 trillion, or about half the land 
value of the whole country. To understand how much this is we can do a 
idealized experiment. Let's borrow on it up to 80% of its value. Banks in Tokyo 
commonly provided such loans based on land security until the high interest 
rates of 1990. From 1987 to 1989, the interest rates on loans secured by land were 
5.7% and 6.6% for variable and fixed rate loans, respectively. We would then 
have almost enough money to purchase all the land in the U.S. for $3.7 trillion 
and all the stock on the New York, American and NASDAQ over-the-counter stock 
exchanges for about $2.6 trillion in an all-cash transaction. Obviously, one could 
not sell all of Tokyo's land for $7.7 trillion quickly, nor would a group of banks 
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undertake such a large loan, but this was the value of land prices in fiscal 1987. In 
Tokyo about 2% of land changes hand each year. The price is kept up and bid 
higher because of the excess of demand over supply. 


Figure 2: Housing Land Price Index 
Tokyo = 100 (¥507,300/ m2), Osaka=62; Nagoya=27, Other Areas=12-18 
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Figure 3: Land Value by Region in Japan 
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Source: Economic Planning Agency reported in Canaway (1990) 


Figure 4 shows that as of 1985 a staggering 56% of the national wealth of Japan 
was land. The current percentage may be even higher since there was a huge 
price increase in 1986 and steady rises since then. 

Land turnover is very small as the Japanese believe in holding land whenever 
possible this is reinforced by the tax system which encourages the purchase of 
more land and discourages land sales. The population in per unit of habitable 
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area is thirty times higher in Japan than the U.S. The GNP and energy 
consumption per habitable area are also much much higher in Japan than in the 
U.S. (though the energy per unit GNP is much lower in Japan), see Table 3. This 
puts upward pressure on land prices. 


Figure 4: Composition of National Wealth in Selected Countries 
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Source: Economic Planning Agency, Japan 


Table 3: Comparison of Fundamentals, Japan and the U.S., 1989 


Japan as 
Japan U.S. % of U.S. 
Population, millions 120 239 50.21 
Total area (1000 sq km) 377 9373 4.02 
Habitable area (1000 sq km) 80 4786 1.67 
Population per habitable 1500 50 3000.00 
area (pop/sq km) 
GNP per habitable area 16.90 0.80 2112.50 
(million $/sq km) 
Energy consumption (tons 4650 390 1192.38 


oil equivalent/ sq km) 


Source: Daiwa Securities America, Inc 


If there is to be a major stock market crash in Japan it may well start with or be 
linked to land values. We study the link between the stock market and land 
prices in this paper. Central Tokyo land price increases were relatively firm in 
1988-1990 but large increases continued in the suburbs and in other cities. Will 
property prices crash at some later date? It is hard to say, but with the bulk of the 
property controlled by the major companies and the government, with the tax 
structure and the incentives favoring land holding, and with buying by both 
individuals and institutions, the prices may well stay at these lofty levels. 
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Boone (1989) developed several models in an attempt to rationalize the high 
land values in Japan from an economic point of view. He found that if Japan's 
GNP growth exceeds that in the U.S. by about 2% per year forever, then land 
prices 100 times higher in Japan than in the U.S. are consistent with the economic 
model. He also developed a simple model for the relative land price in Japan 
versus that in the U.S.: 


. hee Land value Rents GNP 
Relative land price in Japan = Rents x GNP X Tand 
1 
= Fi x 1 x b, 


where a is essentially the ratio of rents in Japan for comparable properties which 
are known to be 0.25 to 0.50 in the U.S., b is the GNP to land estimate which 
ranges from 20-30 (Table 2 estimates this at 21 in 1989), and rents/GNP= 1 is 
consistent with the Cobb-Douglas production model that Boone assumes. Using 
these a's and b's gives relative land prices close to the actual ratio of 100. 


JAPANESE HOUSEHOLDS 

The vast majority of the wealth of Japanese households is contained in their 
land and buildings. This constitutes nearly two thirds of their assets. Various 
savings deposits amount to about 14%. Rates of return on these savings accounts 
are regulated, change infrequently and have been low - about 4% for the best 
investments- but this income is usually not taxed. Deregulation in 1989 and 1990 
has made available acounts paying 7% or more, where the income is taxable. 
Securities, insurance and pension assets are only about 6.9% of the Japanese 
wealth and the proportion of this that is corporate stock is astonishingly low, 
some 0.3%. Government bonds and bank debentures each accounts for 0.9%, so 
the largest bulk of the savings goes into cash savings instruments. This is 
partially explained by the practice of financing Japanese corporate needs with bank 
loans rather than equity. Household assets are growing at about $1.83 billion U.S. 
per day (see Ziemba and Schwartz, 1991b). The excess is going mostly into savings 
deposits much of which is then made available largely to corporations for loans to 
purchase assets in the stock market, land, bonds, and overseas investments. 

Households in the U.S., in contrast, have their assets concentrated in time 
deposits (23.2%), insurance and pension funds (13.1%) and securities (19.7%) 
along with residential (23.4%), with land comprising only 7.1% of total assets. 
Table 4 details the differences in the household shares in Japan and the U.S. using 
averages for 1974-83. Consumer durables are one third as widespread in Japan 
compared to U.S. households, which is not surprising considering the consumer 
oriented nature of the U.S. economy. Space is a major factor in this. In Japan 
there simply is little space to store things, hence much less is purchased and 
stored. Moreover when new items are purchased old ones are trashed or moved 
out elsewhere. 
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Table 4: Portfolio Shares of Japanese and U.S. Households, Averages for 1974-83 


Japan U.S. 
Cash 0.8 na 
Demand Deposits 1.2 3.4 
Time Deposits 7.9 23.2 
Postal Saving 4.9 - 
Trust 1.8 - 
Insurance and Pensions 3.8 13.1 
Securities 3.1 19.7 
Residential Structures 11.6 23.4 
Land 53.4 71 
Consumer Durables 4.0 11.9 
Noncorporate Structures 
equipment & invent 7.3 6.1 


Source: compiled in Noland (1988) 


Figure 5 shows the volume of real estate loans. Of particular interest is its 
variability and predictive power for leveling off periods. Rents shown in figure 6 
are rising slower than land prices. Hence one expects increases in valuation to 
provide profits. That is, the short-run losses which are tax deductible off income 
will be made up by long-term capital gains which are taxed at lower rates. For 
additional discussion on these land prices issues, see Boone (1989), Canaway 
(1990), Cults (1990), Fingleton (1990) and Flack (1990). 

High interest rates which led to a sharp fall in stock prices in 1990 have not led 
to any decline in land prices as shown in Table 2. However, there was a sharp 
decline in speculative land such as golf course membership and condos, see 
Figure 13 below and Stone and Ziemba (1990). As interest rates rise, land demand 
falls but in Tokyo, with virtually no new supply, demand still greatly exceeds 
supply. At the same time supply declines with higher interest rates as 
development costs are curtailed. All the incentives favor holding land and not 
even developing it. As Canaway (1990) has pointed out, land held less than five 
years is taxed at fully 52% of its sale value. Meanwhile, yearly taxes paid to hold 
land are about 0.05 to 0.10% of current value. Even upon death it pays to borrow 
money which is deductible at full value while land is valued at about half its 
market value. Hence inheritance taxes are minimized. Canaway argues that in a 
major crash the stock market will go first, then the economy and finally the land 
markets. The results in the paper are consistent with this view. 
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Figure 5: Real-Estate Loans in Japan 
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Figure 6: Residential Land Prices vs Rents 
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Source: Japan Real Estate Research Institute, Management and Coordination Agency 


LAND AND THE NSA CORRELATIONS 
The growth in the Nikkei stock average and its declines are discussed by Schwartz 
and Ziemba in the preceeding article in this volume. 

This paper uses the NSA to study the relationship between the levels of stock 
and land prices in Japan. The NSA is the most popular index and presumably 
reflects investor sentiment in the economy, so it might relate to land prices at 
least as well as the Topix. Since the Topix and NSA have a very close correlation, 
one would not expect the results to be much different anyway. For more details 
and results on the Japanese stock market, see Elton and Gruber (1989), and Ziemba 
and Schwartz (1991a), and the papers in this volume. 

The biannual data from 1955 to 1988, yields the following correlation matrix 
for the NSA, and land price indices for all land, all commerical land, housing 
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land, industrial use land throughout Japan and these latter four indices for the six 
largest cities. 


Table 5: Contemporaneous correlations between the NSA and various land 
indices. 


Nikkei Land Indices Throughout Japan Indices in argest Cities 
Dow ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 

1.000 
0.858 1.000 


0.857 0.998 1.000 

0.874 0.995 0.989 1.000 

0.820 0.996 0.997 0.984 1.000 

0.963 0.958 0.960 0.961 0.938 1.000 

0.988 0.879 0.885 0.885 0.850 0.978 1.000 

0.956 0.965 0.962 0.974 0.942 0.996 0.964 1.000 

0.889 0.991 0.995 0.981 0.987 0.978 0.919 0.975 1.000 


OANA ALEWN e 


All the correlations of land with the NSA are high. The lowest is 0.82 for 
industrial land throughout Japan. The six largest cities have the highest land 
prices and their land values correlate very closely with the NSA. Indeed all land 
is at 0.963, housing at 0.956 and commerical land at 0.988. Industrial land, much 
of which is lower priced large acreage away from the major cities, correlates less 
with the NSA. 

The concern is with causality between land and the NSA, so we record the lag 
behind and future correlations. The six month prior lag behind index values for 
the land versus the current NSA are shown in Table 6. 


Table 6: Correlations between current NSA values and lagged behind land values 


. ice: ro t Ja Land Indices in 6 Largest Cities 
Current Six Months Before Six Months Before 
NSA ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 
1.000 
0.853 1.000 


0.847 0.998 1.000 

0.873 0.995 0.988 1.000 

0.816 0.996 0.998 0.983 1.000 

0.945 0.970 0.969 0.973 0.953 1.000 

0.978 0.905 0.906 0.911 0.881 0.980 1.000 

0.940 0.972 0.966 0.982 0.951 0.996 0.968 1.000 

0.868 0.993 0.996 0.982 0.992 0.980 0.929 0.974 1.000 


WOONauRwWNH 


These correlations are only slightly lower then the contemparaneous 
correlations. Again commercial land in the six largest cities has by far the highest 
correlation. 


Chapter 10: The Chicken or the Egg 


251 


Land and Stock Prices in Japan 57 


Lagging forward yields the current NSA and the six month future land index 
values in Table 7. 


Table 7: Correlations between current NSA values and future land values 


a ic ices j est Cities 
Current Six Months Later Six Months Later 
NSA ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 
1 1.000 
2 0.863 1.000 
3 0.868 0.998 1.000 
4 0.875 0.996 0.989 1.000 
5 0.824 0.996 0.996 0.984 1.000 
6 0.976 0.946 0.951 0.948 0.923 1.000 
7 0.993 0.860 0.871 0.864 0.827 0.978 1.000 
8 0.966 0.960 0.960 0.968 0.935 0.995 0.960 1.000 
9 0.915 0.984 0.990 0.975 0.977 0.979 0.923 0.978 1.000 


These correlations are higher than the lag behinds and the contemparaneous 
and suggest that the NSA's current value predicts land values in the future better 
than the reverse. The correlation with commercial land six months later in the 
six largest cities is an astounding 0.993, or an R2 of 0.986 for a regression fit. Hence 
the level of the NSA explains nearly 99% of the biannual variation of the level of 
commercial land prices in Japan's six largest cities in the past 34 years! 

Table 8 has the correlations with the NSA and commerical and in the six 
largest cities with lag behind of six (-1) or twelve months (-2), and lag forward of 
six (+1), twelve (+2) or eighteen (+3) months, plus the contemparaneous values, 
again with current NSA. 


Table 8: Correlations between current NSA values and lagged behind commercial 
land values 


Current Commercial Land Indices in an's Six Largest Cities 
NSA -2 -1 Current +1 +2 +3 
1 2 3 4 5 6 vA 

1 1.000 

2 0.949 1.000 

3 0.978 0.996 1.000 

4 0.988 0.985 0.996 1.000 

5 0.993 0.964 0.982 0.994 1.000 

6 0.989 0.943 0.966 0.983 0.995 1.000 

7 0.985 0.926 0.950 0.971 0.987 0.996 1.000 


The best contemparaneous fit is with commercial land in the six largest cities 
as shown in Figure 7. The regression equation is (with t statistics in brackets): 
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NSAI =) -818.63 + 75.23 Coml Land, 
(-5.19) (50.70) 
with R2 = 0.975, adjusted R2 = 0.975, DW = 0.468, autocorrelation = 0.759, and SS = 
891.21. The t-statistic of over 50 indicates the high degree of confidence that this 
variable's movements relate closely to those of the NSA. However, the Durbin- 
Wtson statistic is asymptotically equal to 2(1-p) hence at 0.468 there is significant 
positive serial correlation. 


Figure 7: The NSA and Commercial Land Index Values 
in the Six Largest Cities, 1955-1988 


NSA 
In this an succeeding figures, a solid line represents 
30000 the actual NSA plotted at six month intervals 
and the dotted line the prediction equation 
also plotted at six month intervals. / 
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Using commerical land in the six largest cities to explain the level of the NSA 
leaves little room for earnings to explain much. Indeed, adding earnings six 
months ahead, which would be a proxy for current earnings estimates, results in 
an equation with a non-significant earnings variable (although it has the expected 
sign), a lower R? and only a slightly lower sum of squared errors. The equation 
is: 


NSA; = -1297.57 + 75.77 Coml Land, + 36.18 EPSt41, 
(-2.79) (34.60) (0.861) 
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with R2 = 0.971, adjusted R2 = 0.970, DW = 0.476, autocorrelation = 0.759 and SS = 
870.38. Again the low Durbin-Watson value indicates positive serial correlation 
of errors. 

The earnings per share variable is significant with the log model: 


log NSA; = 3.008 + 0.8867 log Coml Land; + 0.6010 log EPS. 
(10.60) (26.60) (4.58) 


However, the fit is not as good, with R2 = 0.950, adjusted R? = 0.948, DW = 0.172, 
autocorrelation - 0.913, and SS = 0.24293. 

Figure 8 shows how much poorer this fit is compared to Figure 7. 

The log model with commerical land in the six largest cities does not fit nearly 
as well as the linear model. This model is 

log NSA; = 4.189 + 0.9734 log Coml Landt, 
(33.2) (31.0) 

with R2 = 0.936, adjusted R? = 0.936, DW = 0.120, autocorrelation = 0.940, and SS = 
0.27737. 


Figure 8: The NSA and Commercial Land Index Values in the Six Largest Cities 
and Six Month Future Earnings, Log Model Predictions, 1955-1988 


1955 58 68 62 64 66 68 78 72 74 76 78 8ð 82 84 86 88 


Using commerical land, the wholesale price index is now investigated. The 
importance of the wholesale price index in Japan can be gleaned from Figure 9 
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which shows that there is price parity at 1988-89 values of the yen if one uses this 
index. This is a painful reminder to all consumers in Japan of the extraordinary 
series of markups before a product the marketplace. The yen at 125-140 is greatly 
overvalued on a price parity basis for retail items compared to the U.S. dollar. 
(See Balassa and Noland, 1988, and Ziemba and Schwartz, 1991a, for surveys of 
these calculations.) See the discussion on a wholesale basis the yen is not 
overvalued. This should be noted by those who advocate a lower dollar to "solve 
the trade deficit problem" with Japan. With a higher yen Japanese manufacturers 
simply modified their operations to make their dollar costs lower. Indeed, as the 
yen rose in the post-1985 period Japanese manufacturers moved in this direction 
strongly. It is now commonplace in Japan for companies to list the yen/dollar 
exchange rate at which they will still be profitable and the percent of capacity at 
which this would occur. Some companies have their breakeven as low as 70¥/$ 
with twenty percent capacity. See the discussion in Ziemba and Schwartz (1991a). 


Figure 9: Yen-Dollar Exchange Rate and Purchasing Power Parity 
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Source: Yamaichi Research Institute, based on "Needs" 
published by the Nihon Keizai Shimbun Inc. and other data.. 


The prevailing view is summed up by Akio Morita, the Chairman of Sony, "A 
stronger yen would not help one jot: they will pay more for the same Japanese 
products and we will buy up America even more cheaply.” For an econometric 
analysis of the low dollar scenario see Marris (1989, 1987). See also Bergsten 
(1985). Marris' 1985 "predictions" were remarkably accurate during the fall 1985 to 
end-1987 period, but the recession he forecast did not come true. From 1988 to 
1990 the yen/$ rate has been in a trading range of 120-160. Marris and Bergsten 
among others are still advocating a weak dollar to help resolve the trade deficit. 
That may help. But I take the view, held my many Japanese, that the 


Chapter 10: The Chicken or the Egg 


255 


Land and Stock Prices in Japan 61 


econometrics simply will not work well as the structural characteristics of the 
Japanese productive economy are changing. What the U.S. needs more than a 
low dollar are better products that U.S. and foreigners want to buy, more savings 
and less consumption induced by higher taxes. 

Using the wholesale price index and commerical land in the six largest cities 
one obtains the following equation for the level of the NSA with both coefficients 
highly significant and an R2 of 98%: 


NSA, = -657.18 - 51.37 WPI, + 74.75 Coml Landt 
(-4.40) (-3.84) (55.20) 
R? = 0.980, adjusted R2? = 0.979, DW = 0.592, autocorrelation - 0.694, and SS = 
809.57. This is displayed in Figure 10. 


Figure 10: The NSA and Commercial Land Index Values 
in the Six Largest Cities and Wholesale Price Indices, 1955-1988 
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One obtains a slightly better equation, shown in Figure 11 and a near perfect fit 
from 1985-1988 when there was a steep increase in both land and stock values, by 
using commerical land lagged six months ahead. One then has 
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NSAt = -453.85 - 24.30 WPI + 65.94 Comml Landt41 
(-3.91)  (-2.28) (69.8) 
with R? = 0.987, adjusted R? = 0.987, DW = 0.339, autocorrelation = 0.824 and SS = 
642.14. 
Although not needed to predict the NSA, earnings per share does have the 
right coefficient sign and the coefficient is significant in both of the latter 
equations. For example, with contemparaneous commerical land 


NSA, = -1581.13 - 62.59 WPI + 72.17 Comml Land; + 89.58 EPS; 
(-3.71)  (-4.53) (41.9) (2.30) 


with R2 = 0.9815, adjusted R2 = 0.9806, DW = 0.683, autcorrelation = 0.650 and SS = 
783.63. 


Figure 11: The NSA and Commercial Land Index Values in the Six Largest Cities , 
Lagged Six Months Forward Wholesale Prices and Lagged NSA Indices, 1955-1988 
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Adding lagged NSA index values to the commerical land and the wholesale 
price index gives an R? above 99%. Leading or lagging or contemparaneous 
commercial land makes little difference in the R? fit but the coefficients change 
drastically. The commercial land six months ahead has the best fit, with good 
Durbin Watson statistics, as shown in Figure 12. 
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The three equations are 


NSA; = -287.08 - 26.40 WPI, + 23.85 Coml Landų1 + 0.7318 NSAqt 
(-3.56)  (-4.03) (5.18) (9.56) 


R? = 0.9942, adjusted R? = 0.9940, DW = 1.51, autocorrelation = 0.212, and SS = 
392.02. 


NSA; = -86.54 - 29.97 WPI + 3.64 Coml Land, + 1.0567 NSAt-1 
(-0.817) (-3.64) (0.548) (10.8) 


R2 = 0.9930, adjusted R2 = 0.9927, DW = 1.53, autocorrelation = 0.209, and SS = 
483.00. 


NSA = -46.78 - 31.15 WPI + 0.3425 Comml Landt-1 + 1.1043 NSAt-1 
(-0.429) (-3.53) (0.0575) (14.4) 


R2 = 0.9932, adjusted R2 = 0.9928, DW = 1.61, autocorrelation = 0.175, and SS = 
477.09. 


Figure 12: The NSA and Commercial Land Index Valuesin the Six Largest Cities, 
Lagged Six Months Forward, Wholesale Prices, and Lagged NSA 1955-1988 
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It is not very satisfying to use lagged NSA to predict current NSA. Except 
when using the future commercial land it takes over the equation. Indeed with 
lagged behind or contemporaneous commercial land, even that variable becomes 
insignificant. 
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We have the strong conclusion that the NSA stock index value is intimitely 
tied to the values of land in Japan, particularly commerical land in the six largest 
cities, and where these prices are headed in the next six months. Hence if the 
commonly discussed crash that many westerners see as inevitable happens, it will 
likely be tied to a simultaneous crash in land values. However, in 1990 there was 
a sharp drop in stock prices while land prices increased. 

Using bi-yearly data and price levels rather than price changes makes the 
analysis simpler and the predictions more accurate. This approach can be severely 
criticized since one is in effect using time as an explanatory variable. Nelson and 
Kang (1984) discuss this. This bias leads to lower variances and hence greater than 
true significance. However, the results are so strong that they seem to stand up 
well. Our purpose is modest, to simply ascertain whether or not land price levels 
were instrinically related to stock price level and which economic indicator seems 
to lead the other. The results seem to clearly indicate this dependence and the 
stocks leading land causality seems clear. But one realizes that the prediction of 
the changes even over six month periods is not that good by comparing Table 2 
and Table 7 in the Ziemba and Schwartz paper in this volume. While there were 
twenty-two stock price declines of 10% or more there was only one land price 
decline of about 5% in 1975-77, the second oil crisis period. Roll (1988) discusses 
the ability to predict stock price changes on daily or monthly basis using CAPM 
and APT models. The fits measured by R2 are in the 0.20 and 0.35 range for daily 
and monthly data, respectively and higher for quarterly returns. Stone and 
Ziemba (1990) have investigated such price change correlations for land and stock 
prices using quarterly data. They found that stock price changes do lead land price 
changes using Granger-Sims causality tests. The P-values for the hypothesis that 
Topix does not lead all land is only 0.0000149 versus 0.15872 for the reverse 
hypothesis that all land leads the Topix. Canaway (1990) has estimated that the 
Topix lead land by eleven months during the period May 1985 to April 1989. 

Stone and Ziemba concluded that: (1) stock prices are much more volatile than 
land prices; (2) despite their high level, the main increase in land prices was 
before 1971; the 1986-88 rise was high but the earlier rises were cumulatively 
much larger; (3) in late 1990 land prices were not falling but speculative land 
investment in condos and golf course membership, etc. fell sharply since March 
1990. These investments may be highly related to small stocks; (4) golf course 
membership price changes do not usually lead the stock market but they did for 
the 1987 crash; (5) golf course membership price changes do lead land price 
changes; (6) the golf course membership prices increased more rapidly than stock 
prices which in turn outpaced land price increases. As of March 1991 there was a 
considerable gap between Tokyo golf and the overall land market; see Figure 14. 
In the past the previous gaps particularly in 1987, have been quickly closed; and 
(7) despite a great desire of the government to cool down the land market and to 
try to engineer a 20-30% fall, extreme demand cash flows, cultural aspects and 
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unwillingness of the Diet to pass any serious tax laws that would probably 
generate such a fall, a great crash is unlikely. 


Figure 13 
The NSA Index versus the Nikkei Golf Membership Index, 1981:4-1991:1 
1000 
900 A, 
800 \ 
Golf led 
700 Oct 87 fall Na k 


81:4 82:4 83:4 84:4 85:4 86:4 87:4 88:4 89:4 90:4 


Source: Stone and Ziemba (1990) using data from th Nihon Keizai Shimbun, Inc. 


Stone and Ziemba (1990) found that the dynamic regression model shown in 
Table 9 predicts land price data using past stock price returns. 
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Table 9 
The Dynamic Regression Model to Predict Land Price Changes Using Past Stock 
Price Returns 


Variable Coefficient Standard Error t-Statistic Significance 
Constant 0.9689 0.4041 2.3980 0.0160 
Topix (-4) 0.0128 0.0072 1.7880 0.0740 
Topix (-3) 0.0355 0.0108 3.2910 0.0010 
Topix (-2) 0.0398 0.0108 3.6650 0.0001 
Topix (-1) 0.0161 0.0070 2.2920 0.0220 
Auto (-1) 1.6509 0.0833 19.7970 0.0001 
Auto (-2) -0.8001 0.0832 -9.6160 0.0001 
R2 0.954 

Adjusted R2 0.949 

Akaike Criterion (AIC) 0.464 

Schwarz Criterion (BIC) 0.526 

Durbin Watson 1.605 

RMS Error 0.411 


Source: Stone and Ziemba (1990) 


Figure 14 shows the results of the model in the sample estimation period, 
1972:2 to 1989:1, and in the forecast period, 1989:2 to 1990:3. 


Figure 14 

Prediction of Land Price Models In and Out of the Sample Estimation Period, 
Quarterly Data, 1972:3 to 1989:1 and a Forecast Period of 1989:2 to 1990:3 for the 
Dynamic Regression Model 
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Abstract: This paper investigates evidence on several seasonal regularities in the security price 
returns on the Tokyo Stock Exchange. The study uses data on the NSA and TOPIX market 
indices from 1949-88. Results are presented concerning monthly, turn-of-the-month and first- 
half-of-the-month, turn-of-the-year, holiday and golden week effects on the TSE. 


Keywords: Seasonal regularities, monthly effects, turn-of-the-month effects, turn-of-the-year 
effects, holiday effects, golden week effects, NSA market indices, TOPIX market indices. 


1. Introduction 


Research in English on Japanese anomalies is quite recent. Part of the 
reason for this is a lack of interest in such studies by the big Japanese 
brokerage firms who are unaware and suspicious of their potential use. These 
firms do not like to publish their research findings even in Japanese. Recently, 
however, U.S. and Japanese researchers have been studying these markets. The 
thrust has been mainly to ascertain the similarities and differences with the 
analogous results in U.S. markets. Kato, Schwartz and Ziemba (1989) have 
surveyed the research and presented new results on day-of-the-week effects in 
Japanese security markets. Other aspects of Japanese security markets are 
discussed in Elton and Gruber (1989), Ziemba and Schwartz (1991) and 
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Ziemba, Bailey and Hamao (1991). The U.S, literature on seasonal regularities 
is surveyed in Schwert (1983), Keim (1986), Jacobs and Levy (1988) and the 
books by Dimson (1988) and Ziemba (1992). 

To begin our study of seasonal regularities and to provide background for 
the reader, the paper begirs with a survey of recent work on the January, 
monthly and size effect literature which is based on data for subsets of the 
1949-88 periods for the first section of the (TSE). Since the first section of the 
TSE has about 86% of the trading values and volume, its study provides a 
good measure of the whole market. As of December 1988 the first section of 
the TSE had a capitalization of about ¥ 450 trillion which was more than the 
value of all the stock exchanges in the U.S. This survey is then complemented 
by evaluating the monthly effects over the whole sample period. New results 
on the other effects are discussed in succeeding sections of the paper. The 
basic efficient markets working hypothesis is that all days are equivalent and 
have the same mean returns. The seasonality results show that there are 
significant departures from this. An attempt is made to discuss why such 
departures occur for the various effects. The answers seem to be a combina- 
tion of cash flows, institutional and cultural factors and differences in risk. 


2. The January and monthly effects 


There seems to be a January effect in Japan similar to that in the U.S., 
Canada and many other countries. In Japan, the effect is not based on tax loss 
selling. For individuals, there were no taxes on capital gains or credits on 
losses during the period of this study. Corporations must pay taxes on capital 
gains but each firm can choose its own tax year and the majority are in March 
not at the turn of the year. 

In addition to excess gains in January particularly for small stocks, there 
are excess gains in June for small stocks. A major reason for these gains seems 
to be the large semi-annual bonuses paid by most Japanese companies in 
December and June. The precise dates vary but it is typical to pay these 
bonuses early in the month. For example, the big brokerage firms pay these 
bonuses near the beginning of the month. These bonuses can amount to as 
much as three months salary and they are steeped in tradition. They provide a 
great measure of flexibility to Japanese corporations facing economic swings. 
Employees purchase gifts - called Ochugen in the summer and Oseibo in the 
winter. They also invest some of their considerable savings in the stock market 
and this boosts the prices. Earnings forecasts may also be a factor. Corporate 
officers often make their earnings forecasts in May and financial analysts 
make theirs in March, June, September and December; see Kunimura (1984). 
Darrough and Harris (1991) and Ziemba (1990) show the strong effects of 
earnings on stock prices. Whether or not the timing of earnings forecasts 
injects a seasonality effect in the market is not well understood. There is, 
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however, a tendency to delay bad news and accelerate good news which may 
be a factor in the turn-of-the-month effect. 

Kato and Schallheim (1985) studied the monthly returns in Japan during 
the twenty-nine year period 1952 to 1980. The number of firms with clear data 
ranged from 529 in 1964 to 844 in 1980. Those firms that were delisted or had 
other data irregularities had similar monthly effects to those firms in the 
sample. Equally and value weighted indices were constructed from the Nissho 
Monthly Stock Price file (1952-80) and the Nikkei Needs Financial Data file 
(1964-81). They found that there was a small firm effect: the average monthly 
return for the equally weighted index (EWI) was 0.42% higher than the value 
weighted index (VWI). This yearly edge of (1.0042)'? — 1 = 5.16% is similar to 
that found in the U.S. in this period. The 5.16% is an overstatement of the 
actual buy and hold mean return difference due to the bias discussed in Roll 
(1983b). However, only the post 1964 period has the small firm effect in 
Japan. This is similar to the U.S. [see, e.g. Ziemba (1992) for precise results 
and references]. In Japan this also corresponds to the opening up of the 
Japanese economy to foreign investors. There is also a strong January seasonal 
effect in both the equally and value weighted indices of 7.08% and 4.48%, 
respectively, relative to the other months. The mean return difference was 
larger in the 1952-63 period, 8.27% and 5.99%, respectively, than in the 
1964-80 period, which gained 6.24% and 3.41%, for the EW! and VWI, 
respectively. From 1964 to 1980 Kato and Schallheim found the mean return 
differences in January returns to be size dependent. For the equally weighted 
index, the excess gain in January relative to the other months ranged from 
8.68% for the smallest decile firms and 3.18% for the largest decile firms with 
the other firms being 


Largest 2 3 4 5 6 7 8 9 Smallest 
3.18 3.82 4.42 6.25 6.12 7,16 8.15 8.20 8.55 8.68 


For the value weighted index the January versus the rest of the year mean 
return differences were 


Largest 2 3 4 5 6 7 8 9 Smallest 
— 0.98 —0.19 0.61 1.19 2.18 3.34 4.05 4.38 4.87 5.16 


Analogous to the U.S. Value Line-S&P500 spread gains in January, see 
Clark and Ziemba (1987) and Vander Cruyssen and Ziemba (1991), Kato and 
Schallheim computed the mean return difference between the equally and 
value weighted indices by portfolio size. This is not the best measure for the 
small versus large firm spread but one that gleans the effect. The spread in 
January excess returns relative to the CAPM model using equally and value 
weighted indices were 


Largest 2 3 4 5 6 7 8 9 Smallest 


EW — 2.48 —2.01 -144 -084 -052 053 108 140 1.82 2.02 
YW — 0.98 —0.19 0.61 1.49 2.18 3.34 405 4.38 4.87 5.16 
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Fig. 1. Mean daily returns by month in Percent for various small stock portfolios versus the 
TOPIX, 1975-88. 


Source: Yamaichi Research Institute. 


The monthly size effect is shown in fig. 1 using our updated data from 1975 to 
1988 using daily returns for the smallest 10, 25, 50 and 300 stocks and the 
TOPIX. Kato and Schallheim also found elements of June effect. So does the 
more recent data. Both January and June have significantly higher returns 
than the other months for small firms. Actual calculations appear in Kato 
(1990b) and Jaffe and Westerfield (1985a, b). 

Another study (unpublished and in Japanese) by Horimoto (1988) based on 
monthly return data on the equally weighted index is described in tables 1 and 
2 for all the stocks on the first section of the TSE from January 1965 to 
December 1987. Table 1 indicates that the smallest decile stocks gained 9.53% 
in January on average which is 7.21% more than the largest decile stocks. The 
smallest stocks also gained 4.34% more, on average, than the largest stocks in 
June. March and December are strong months relatively for the big stocks 
probably because of window dressing and corporate dividend and reporting 
effects. By month the highest average returns on the TSE were in January, 
followed in order by March, December and June. There seem to be two 
windows of opportunity for small stock buyers. The period January and 
February has a higher mean return of about 8.6% and the April to July period 
has one of about 8.2%. Small stocks are not in favor by the big four securities 
companies hence they have not encouraged their clients to bid up these prices 
to close these rather large differences in small versus large firm average 
returns. The data also show clearly the observation that September and 
October are typically weak months for all stocks. Risks in small stocks are 
higher than in big stocks as measured by the standard deviations shown in 


Mean returns in percent of ten size deciles on the first section of the TSE, by month 1965-87. 


Table 1 


Firm Jan. Feb. Mar. Apr. May June July Aug. Sep. Oct. Nov. Dec Mean Stdev St dev 

decile stocks index 

size 

Smallest 1 9.538 2.108 1.289 1.305 2536 5.308 2.759 1.543 -0.426 1.135 1.081 2625 2.567 6107 2.591 
2 7.704 2.379 2.064 0.622 1.818 4436 2.155 0859 —0.261 0.568 1.083 2.350 2.148 5.186 2.126 
3 6.993 1.796 1.908 0697 1.876 3.963 2.374 0.881 0.229 0.327 1.036 2.140 2.018 5.005 1.879 
4 6.303 1568 2.882 0.790 1.042 3.403 0.670 0.877 -90.270 0.545 0.831 2.102 1.729 4.843 1.774 
5 5.931 2.002 3.162 0.681 1.250 2.603 1.126 1.315 0.258 0.224 0.860 2.052 1.789 4811 1.586 
6 4866 1.938 3.194 0.795 1.350 2.390 0.816 1.289 -0001 -—0.365 1.255 2.098 1635 4.513 1.419 
7 4596 1.706 3.315 0.875 0.800 2.506 0.327 1.388 0.580 0.000 0.898 1.759 1.563 4.422 1.338 
8 3.232 1.373 3.527 0.934 0.619 1.454 0.232 1.457 0.830 —0.104 1.215 1.906 1.390 4.100 1.085 
9 2850 1.446 3.663 0.551 0.714 1.281 0.290 0.739 0.742 —0.156 1.579 2.567 1.356 4.351 1.141 

Largest 10 2.332 0.774 3.763 0.934 0.899 0.966 0.461 1.213 1.129 —0.430 1.295 3.142 1.373 4.808 1.163 

Mean 5.435 1.709 2.877 0.818 1.290 2.831 1.121 1.156 0.281 0.174 1.113 2.274 2.757 4.815 1.610 


Source: Adapted from Horimoto (1988). 
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Firm Jan. 

decile 

size 

Smallest 1 4.752 
2 3.747 
3 3.518 
4 2.835 
5 3.107 
6 2.530 
7 2.349 
8 2.478 
9 2.107 

Largest 10 3.777 

Mean 3.120 


Source: Adapted from Horimoto (1988). 


Feb. 


3.897 
3.604 
3.078 
2.516 
3.130 
3.405 
3.232 
3.275 
3.638 
3.896 
3.367 


Standard deviations of ten size deciles on the first section of the TSE, by month 1965-87. 


Mar. 


Apr. 


May 


June 


5.197 
4.399 
4.840 
4.002 
3.255 
3.272 
3.270 
2.753 
2.511 
2.779 
3.628 


Table 2 


July 


Aug. 


8.000 
6.829 
7.276 
7.470 
7.704 
6.838 
6.808 
6.073 
6.204 
5.862 
6.906 


Sep. 


Oct. 


6.648 
5.350 
5.642 
5.169 
5.080 
5.334 
4.835 
4.819 
4.751 
5.249 
5.288 


Nov. 


5.381 
5.546 
4.991 
5.332 
5.553 
5.276 
4.771 
3.677 
5.275 
5.860 
5.266 


Mean 


St dev 
of stock 


5.564 
4.780 
4.645 
4.447 
4.484 
4.238 
4.159 
3.933 
4.147 
4.683 
4.508 
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table 2. However, the risks in January and June are lower than in most other 
months and are below the mean monthly standard deviation. They are also not 
much higher than the average standard deviations by month for all ten of the 
deciles, Small firms in Japan in recent years have had low beta values. One can 
conjecture that the low betas are probably due to thin trading and the bias 
that this gives to beta estimates that are not computed using the Dimson or 
Scholes—Williams estimation methods. However, upon calculation these dif- 
ferences are minor. The small stocks simply do not move with the market 
indices in a regular pattern. For details, see Ziemba and Schwartz (1991). The 
advent of programmed trading once futures contracts were available on the 
SIMEX in 1986 and later in Japan has accentuated this disassociation of the 
small stocks from the NSA index. 

For the five-year period July 1983 to June 1988 the average beta for the 
NSA (relative to the TOPIX’s 1.00) estimated by the TSE was 0.84. For the 
largest capitalized stocks it was 1.16, for the medium caps 0.48 and for the 
smallest stocks only 0.37. For this study small firms were defined to be those 
with less than 60 million shares outstanding, medium sized firms with 60 to 
200 million shares and large firms more than 200 million shares. The TOPIX 
index is a value weighted average of all, some 1135, stocks on the first section 
of the TSE. The NSA is a price weighted average of 225 well known stocks. Its 
value is computed like the Dow Jones Industrial Average (DJIA) by adding up 
the values and dividing by the current divisor. More details on these indices 
appear in Ziemba and Schwartz (1991). These results are shown in fig. 2 using 


Mean return 
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Fig. 2. Mean returns by month and decile size for TSE stocks, 1985-87. 


Source: Constructed from data compiled by Horimoto (1988). 
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MRO = Smallest minus largest stock decile 


Fig. 3. Mean return differences by month and decile size for TSE stocks, 1985-87. 


Source: Constructed from data compiled by Horimoto (1988). 


capitalization deciles. Fig. 3 shows the monthly small firm minus large firm 
return differences. 

Kato (1990a) investigated the small-firm effect using daily returns on all 
TSE stocks, namely the value-weighted TOPIX index, from 1974, when there 
were 806 securities, to 1987 when there were 1069 securities. He used yearly 
rebalancing on portfolio size and ranked the securities into five size categories 
with 1 denoting the smallest capitalized stocks and 5 the largest. The average 
firm sizes for these groups were 773 million yen to 32,689 million yen, 
respectively. Table 3 describes the results. 

The small stocks outgain the big stocks by a two to one margin and the gain 


Table 3 
Mean returns for size related portfolios, TOPIX, 1974-1987. 

Firm size Average Mean retum 

market value 

ies Close to Close to Open to 

(¥ million) close open close 
Smallest 773 0.1492 0.1784 — 0.0295 
2 1,716 0.1058 0.1545 — 0.0489 
3 3,275 0.0859 0.1354 — 0.0495 
4 6,554 0.0764 0.0976 — 0.0213 
Largest 32,689 0.0653 0.0483 0.0168 


Source: Kato (1990a). 
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Table 4 
Average monthly returns on a portfolio containing all the stocks in the sample (from 373 to 566 
stocks) over the period January 1955~December 1985. 


Average monthly Sample Equally-weighted portfolio value-weighted portfolio 
PerUrR NEE Sze Average t-Statistic Average t-Statistic 
return return 

AH months 372 1.60% *** 6.40 1.84% *** 7.55 
All months 

but January 341 1.24% *** 4.75 1.61% *** 6.22 
January 31 5.58% *** 13.17 4.36% **™ 9.85 
February 31 1.00% 1.85 0.99% 1.73 
March 31 2.90% *** 3.77 3.85% *** 4.48 
April 31 0.51% 0.57 1.01% 1.23 
May 31 0.18% 0.20 0.86% 0.91 
June 31 2.51% ** 4.59 2.43% **™ 4.09 
July 31 0.65% 0.75 0.44% 0.51 
August 31 0.92% 0.85 1,05% 1.09 
September 31 0.13% 0.17 1.28% 1.85 
October 31 0.52% 0.48 0.49% 0.56 
November 31 1.81% 1.76 2.42% * 2.27 
December 31 2.48% ** 3.02 2.87% ** 3.25 


^ In this and succeeding tables * indicates that the average return is significantly different from 
zero at the 5 percent level, with a two-tail test, ** at the 1% level and *** at the 0.1% level. 
Source: Hawawini (1991). 


is monotonic in size. So the usual size effect was present in Japan during 
1974-87. Like in the U.S. the yearly size effect is not present during 1983-87. 
However, even in this period small firms do have much higher absolute and 
risk adjusted returns in January and June. Interestingly all the gains occur in 
the non-trading period at night as the close-to-open returns are statistically at 
least as high as the total, i.e., close-to-close returns. The returns during the day 
provide no gains at all and in fact are slightly negative except for the highest 
capitalized stocks. This finding is somewhat akin to Keim and Smirlock’s 
(1987) results for U.S. stocks who found that the major gain in the Value Line 
small stock index was mostly at night and there were larger gains during the 
day for the large cap S& P500 index. See Kato (1990a) for an analysis of these 
effects by day of the week in January and other months and Kato, Schwartz 
and Ziemba (1989) for some plausible reasons for this behavior. 

Additional insight has been provided by Hawawini (1988, 1991). His study 
used monthly data for the 31 years January 1955 to December 1985, The firms 
he considered trade on the first section of the TSE. There were 373 firms 
which traded continuously during this entire period and 566 which traded in 
the second half, January 1970—-December 1985. The effect across months 
appears in table 4. January has the highest returns for small stocks (the 
equally-weighted index) averaging 5.58% per month. The big stocks (the value 
weighted index) do nearly as well, returning 4.36% per month. June and 
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Table 5 
Risk and return characteristics of the largest and smallest size portfolios partitioned into four 
sub-portfolios ranked by decreasing magnitude of their beta coefficient risk TSE, January 
1955—December 1985. ° 
Average Sub-portfolios ranked by risk Average monthly return 
portfolio size Risk Size Allvear = January 
eee eae year January 
(in millions) (beta) (¥ millions) 
¥ 164,716 1.44 ¥ 158,275 1.29% 6.20% 
largest 0.89 ¥ 179,708 0.77% 3.33% 
quintile 0.57 ¥ 160,110 0.61% 2.74% 
0.21 ¥ 160,772 0.69% 1.14% 
¥ 4942 1.97 ¥ 5,406 2.02% 12.40% 
smallest 1.37 ¥ 4,776 1.34% 8.16% 
quintile 0.97 ¥ 4,710 1.35% 5.75% 
0.43 X 4,876 1.42% 2.51% 


* All beta coefficients and mean returns are significantly different from zero at the 5% level. 
Source: Hawawini (1991), 


December, the bonus months also have high returns in the 23% range, and 
March has high returns as well, namely 2.90% and 3.85% for the small and big 
stocks, respectively. 

The January effect and the excess gains of small stocks over big stocks are 
beta dependent. The small stocks have higher returns than the big stocks and 
the more so the higher the £ is. The smallest stocks with the highest 8s return 
12.40% in January versus only 1.14% for the largest stocks with the lowest Bs. 
Table 5 shows this effect which is analogous to that observed on U.S. stocks. 
See, Ritter and Chopra (1989). A strategy to exploit this is to invest in small 
stocks in January especially those with high betas and then if your transactions 
costs are small moving into big stocks for the rest of the year. For the period 
1975 to 1984 without transactions costs one has returns in U.S. dollars as 
listed in table 6. 

The strategy of being in small stocks in January and big stocks otherwise 
has lower risk than even big stocks throughout the year and nearly as high 
mean returns as being in small stocks all the time. Adding transactions costs 


Table 6 
Returns from various investment strategies on the TSE, January 1955 to December 1985. 


Strategy Arithmetic Geometric Standard deviation 


mean mean using arith. mean 
TOPIX 18% 17% 18% 
Bottom quintile on TSE 
1st Section 28% 22% 38% 
Bottom quintile in Jan. 
and TOPIX rest of year 22% 21% 14% 


Source: Hawawini (1991). 
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Table 7 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1949-1988. 
Month Sample Mean Standard t-Statistic p-Statistic % Positive 
size return, deviation returns 
% of return, 
% 

All days 11529 0.0482 *** 0.0933 5.56 0.0001 54.3 
January 885 0.1816 *** 0.0926 5.80 0.0001 60.7 
February 905 0.0549 0.0886 1.87 0.0622 56.7 
March 986 0.0457 0.0955 1.50 0.1335 55.0 
April 947 0.0623 * 0.0913 2.10 0.0359 54.2 
May 956 0.0074 0.0835 0.27 0.7843 53.1 
June 1011 0.0641 * 0,0852 2.39 0.0169 56.8 
July 1043 0.0083 0.1012 0.27 0.7904 52.3 
August 1038 0.0790 ** 0.0859 2.96 0.0032 55.2 
September 931 0.0059 0.0820 —0.22 0.8252 51.6 
October 998 0.0088 0.1150 0.24 0.8089 50.9 
November 910 0.0371 0.0879 1.27 0.2031 52.4 
December 919 0.0470 0.1038 1.37 0.1706 53.8 


Source: Yamaichi Research Institute. 


for the extra round trip transaction would reduce the difference in specific 
returns by 1-2% per year, but would not change the conclusion that the 
January small stock, TOPIX rest of year strategy dominates the TOPIX alone 
strategy. Additional research on these types of strategies appears in Ziemba 
and Schwartz (1991). 


3. The monthly effect on the NSA and TOPIX market indices, 1949-88 


Table 7 gives the monthly returns on the NSA‘ from 1949-1988 (Septem- 
ber). The data in this section are over longer periods than that discussed 
above, namely, the entire 1949-1988 sample period. January has by far the 
highest returns and all months are positive except for September which is just 
slightly negative. With January excluded one cannot reject the hypothesis of 
equal mean returns in all months. January has significantly higher returns than 
the other months. The market increased 60.7% of the days compared to 54.3% 
in all months (including January). The other months have similar behavior 
with September and October having the lowest returns. Even then more days 
rise than fall. 


! The NSA is a price weighted average of 225 stocks on the first section of the TSE. It is 


computed like the DJIA by simply adding up the values of the 225 stocks and dividing by the 
current divisor, which was 10.289 as of the end of December 1988. The NSA then had a beta of 
0,84 relative to the TOPIX index. 
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Table 8 
Mean daily return standard deviation, and percent positive returns by month of the NSA, 
1949-1959. 
Month Sample Mean Standard t-Statistic p-Statistic  % Positive 
size return, deviation returns 
% of return, 
% 
All days 3193 0.0565 ** 1.119 2.85 0.0044 53.6 
January 232 0.1990 * 1.302 2.33 0.0207 59.9 
February 239 0.0495 1.284 0.60 0.5519 57.2 
March 256 ~— 0.1284 1.290 -1.59 1128 47.7 
April 247 0.1446 * 1.068 2.13 0.0343 52.2 
May 262 0.0142 0.863 0.27 0.7904 50.7 
June 283 0.0192 0.952 0.34 0.7350 54.4 
July 291 0.0897 1.441 1.06 0.2895 54.4 
August 292 0.2482 *** 0.798 5.31 0.0001 59.6 
September 273 0.0330 0.813 0.67 0.5044 55.3 
October 292 0.0481 0.968 0.85 0.3965 52.1 
November 262 0.0031 0.939 0.05 0.9574 48.1 
December 264 — 0.0460 1.453 — 0.51 0.6077 52.3 


Source: Yamaichi Research Institute. 


To investigate the changing patterns, tables 8-11 consider the past four 
decades. The main differences are: 
(a) During the 1950s August had very high returns, surpassing even January, 
while December was slightly negative and March was significantly negative. 
Both January and August had gains on about 60% of the days, while March 
had losses on more days than gains. 


Table 9 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1960-1969. 
Month Sample Mean Standard = ¢-Statistic p-Statistic % Positive 
size return, deviation returns 
% of return, 
% 

All days 3000 0.0368 * 0.868 2.33 0.0201 52.8 
January 232 0.2170 *** 0.725 4.56 0.0001 62.9 
February 241 0.0271 0.673 0.62 0.5327 52.7 
March 258 0.0483 0.958 0.81 0.4187 51.6 
April 248 0.0465 0.770 0.95 0.3424 51.6 
May 246 — 0.0128 0.864 -0.23 0.8170 53.3 
June 257 0.0906 0.933 1.56 0.1211 56.8 
July 265 — 0.0592 0.920 —1.05 0.2957 51.3 
August 267 0.0139 0.783 0.29 0.7728 48.7 
September 246 — 0.0362 0.856 — 0.66 0.5079 47.6 
October 260 — 0.0135 0.101 — 0.22 0.8298 49.2 
November 240 0.0448 0.944 0.74 0.4631 52.9 
December 240 0.0965 0.868 1.72 0.0866 56.7 


Source: Yamaichi Research Institute. 
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Table 10 
Mean daily return, standard deviation, and percent positive retums by month of the NSA, 
1970-1979. 
Month Sample Mean Standard t-Statistic p-Statistic % Positive 
size retum, deviation returns 
% of return, 
% 

All days 2892 0.0391 * 0.853 2.46 0.014 55.3 
January 224 0.1370 * 0.830 2.47 0.014 59.4 
February 225 0.1075 0.830 1.94 0.053 59.1 
March 252 0.1158 ** 0.064 2.87 0.004 60.3 
April 239 ~ 0.0555 1.016 — 0.84 0.399 51.9 
May 240 0.0219 0.810 0.42 0.676 56.7 
June 251 0.0794 0.834 1.51 0.133 57.8 
July 258 0.0193 0.628 0.49 0.622 52.3 
August 259 — 0.0686 1.089 — 1.01 0.324 55.2 
September 232 0.0749 0.758 0.15 0.880 51.3 
October 249 — 0.0915 0.989 -0.15 0.884 49.8 
November 230 0.0335 0.812 0.63 0.532 54.3 
December 233 0.0977 0.898 1.66 0.098 55.4 


Source: Yamaichi Research Institute. 


(b) In the 1960’s January was very strong, having average returns of about 
0.21% per day and the market advanced about 63% of the time. After January, 
the market had a reasonable showing in the first half and then fell in the 
second half of the year but managed to close positively in December. The 


Table 11 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1980-1988. 
Month Sample Mean Standard i- p- % Positive 
size return, deviation Statistic Statistic returns 
% of return, 
% 

All days 2444 0.0625 *** 0.8242 3.75 0.000 56.1 
January 197 0.1658 *** 0.6825 3.41 0.001 60.4 
February 200 0.0361 0.5209 0.98 0.328 58.5 
March 220 0.1648 ** 0.7471 3.27 0.001 61.8 
April 213 0.1175 * 0.7212 2.38 0.018 62.4 
May 208 0.0060 0.8006 0.11 0.914 51.9 
June 220 0.0738 0.6054 1.81 0.072 58.6 
July 229 — 0.0292 0.7747 — 0.57 0.569 50.2 
August 220 0.1074 * 0.6612 2.41 0.017 57.3 
September 180 — 0.0406 0.8047 — 0.68 0.499 51.7 
October 197 0.0027 1.6574 0.02 0.982 52.8 
November 178 0.0814 0.7784 1.40 0.165 $5.6 
December 182 0.0516 0.5936 1.17 0.242 50.0 


Source: Yamaichi Research Institute. 
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Fig. 4a. Mean return in percent by month of the year, 1949-1988. (a) During January to March on 
the TOPIX Index. (b} During April to June on the TOPIX Index. (c) During January to March on 
the NSA Index. (d) During April to June on the NSA Index. 


Source: Yamaichi Research Institute. 
bonus month of June was positive and the market rose about 57% of the time 


then. The July-November period was very weak. 
(c) In the 1970s January again had the highest returns and again June and 
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Fig. 4 (continued). 


December had strong returns. The second half of the year, July-November 
was again weak with net losses in this period although the market did manage 
to close higher on more than half of the days. February and March were 
strong and had returns above December and nearly as high as January. The 
market was up about 60% in these months similar to January’s performance. 
(d) In the 1980s January again had the highest return but only marginally 
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Fig. 5a. Mean return in percent by month of the year, 1949-1988. (a) During July to September 
on the TOPIX Index. (b) During October to December on the TOPIX Index. (c) During July to 
September on the NSA Index. (d) During October to December on the NSA Index. 


Source: Yamaichi Research Institute. 


above March. July was weak again as was the September—October period. 
Other months, such as April, August, November and December, had strong 
returns. 
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Fig. 5 (continued). 


(e) The decades returned 0.056%, 0.037%, 0.039%, and 0.062%, respectively for 
an average gain over the 39 years of 0.048% per day. 


Figs. 4a—d and a-d provide the monthly returns year by year for the 
TOPIX and the NSA from 1949 to 1988. These results are more variable than 
the aggregated results but yield similar conclusions. 
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4. Turn-of-the-month effect 


In the United States the returns on trading days —1 to +4 of each month 
dominate the other days. This is referred to as the turn-of-the-month effect. 
The return in this period, coupled with that in the second week of the month, 
roughly trading days +5 to +8 or +10, essentially amounts to all the gains 
in the spot stock market in the period of the 1960s, 70s and 80s. See Ariel 
(1987) for spot data for the period 1963-82 and Sick and Ziemba (1991) for 
futures data for 1982-91. The rest of the months’ returns is essentially noise 
and at best provides zero returns. The reason(s) for this effect are not fully 
known but part of the story is likely that people receive their salaries on or 
around the —1 day and they receive their stock account statements so they 
have funds to invest in stocks. There are also portfolio balancing effects. When 
would one expect the turn-of-the-month to be in Japan? Most companies pay 
their salaries on the 25th of the month. Does the turn-of-the-month start then 
and is it similar to that in the U.S.? 

Figure 6 displays the daily returns over the 27 possible trading days each 
month in Japan for the NSA, 1949-1988. There are higher returns in Japan’s 
turn-of-the-month: — S5 to +2,. a seven-day trading period. Each of these days 
has returns of 0.10% or more on average and these returns are all statistically 
significant at the 5% level. The —1 day has very high returns, as in the U.S. 
The mean returns are over 0.22%, making its effect about as strong as a 
pre-holiday, namely providing returns about five times as large as on a typical 
trading day. The first-half-of-the-month effect, namely days —5 to +7, a 
twelve-day trading period, is also present. The fifteen-day period +8 to +17 


OMUZUNDA 
Mean return (%) 


Fig. 6. Mean rates of return in percent by trading day-of-the-month, NSA, 1949-1988. 


Source: Yamaichi Research Institute. 
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Table 12 
Turn-of-the-month data, 1949-1988. 
Trading Sample Mean Standard Maximum Minimum Median /-Statistic 
day size return, deviation return, return, return, 
% return, % % % 
% 

= 5 471 0.0899 ** 0.710 3.650 — 2.578 0.0831 2.75 
-4 471 0.1041 ** 0.853 5.321 —2.853 0.1197 2.65 
-3 471 0.1733 *** 0.984 4.118 — 4.216 0.2365 3.82 
-2 471 0.1334 *** 0.911 4.555 — 6.734 0.1392 3.18 
-1 471 0.2255 ** 0.914 2.750 — 8.686 0.3005 5.36 
1 471 0.0980 * 0.875 4.501 — 3.987 0.1563 2.43 
2 471 0.1006 * 0.884 3.673 — 4.849 0.1313 2.47 
3 471 0.307 0.924 5.633 — 4.368 0.0202 0.72 
4 471 0.0592 1.113 11.149 — 9,997 0.0437 1.15 
5 471 0.0358 0.957 6.307 — 10.649 0.0305 0.81 
6 471 — 0.0005 0.811 4.41 —3,.899  -0,0092 -0.01 
7 471 0.0357 0.838 3.471 — 4.600 0.0573 0.09 
8 471 — 0.0585 0.945 4.554 — 8.218 0.0112 — 1,34 
9 471 0.1065 * 0.940 6.877 — 4.716 0.1085 2.46 
10 471 0.0620 0.852 4.795 — 4.399 0.0570 1.58 
11 471 0.0395 0.813 5.394 — 4.025 0.0566 1.05 
12 471 — 0.0196 0.850 4.681 — 6.970 0.0531 — 0,50 
13 471 0.0115 1.055 11.289 — 7.680 0.0201 0.24 
14 471 — 0.0042 0.912 6.408 — 3,542 0.0218 - 0.10 
15 471 — 0.0306 1.237 6.138 — 14.901 0.0616 - 0.54 
16 471 0.0716 1.148 9.888 — 7.493 0.871 1.35 
17 471 — 0.498 0.888 4.716 — 4.253 —0.0380 -1.20 
18 429 — 0.0207 0.870 3.582 — 4,930 0.0206 — 0.49 
19 350 0.0162 0.841 3.924 -5.032 -0.0344 0.36 
20 229 — 0.0286 0.899 4.604 —4.705 —0.0104 — 0.48 
21 118 — 0.0476 0.354 3.600 — 6.610 0.0680 — 0.46 
22 39 0.0562 0.778 2.008 — 1.732 0.0079 0.45 


Trading days 
lst half 22-7 0.01142 
2nd half 8-21 0.00048 


Source: Yamaichi Research Institute. 


and —10 to —6 amounts to a second half of the month which has returns that 
are at best noise. In fact, these returns are negative, on average. The hypothe- 
sis that these returns are less than or equal to zero cannot be rejected at the 5% 
level. 

It is a difficult to compare the days consistently, as each month in Japan is 
lightly different. For example, Saturdays trade on the first, fourth and fifth (if 
there is one) week of the month. Table 12 shows the —5 to +2 and —5 to +7 
periods more clearly. These periods have the highest returns. The remainder of 
the month, starting either on day +8 or possibly on +12 has zero or negative 
returns. This is striking, since the average day during these 39 years had 
returns of nearly 0.05% per day. 

The numerical data of table 12 appears in fig. 6. If we let day +22 be in 
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our turn-of-the-month, then the 22, —5 to +7 period returns more than 1% a 
month, namely 1.142%, on average while in the +8 to +21 period (+8 to 
+17 and —10 to —6 in the other counting) the returns are slightly positive 
but they are insignificant from zero return of only 0.0482%, on average, which 
is only one in 23.7 times as much as in the turn of the month. The days of the 
week have a considerable effect on mean stock returns; see Kato, Schwartz 
and Ziemba (1989) and Kato (1990a). Hence the turn-of-the-month effect is 
commingled with the days-of-the-week effect. Ziemba and Schwartz (1991) 
present the results of a 52-variable model to separate out these effects to 
develop seasonality calendars to evaluate and rank all the days on a common 
basis. 

Discussions with experienced brokerage executives provided the following 
list of plausible reasons for the turn-of-the-month effect. They involve finan- 
cial flows or institutional arrangements that collectively seem to cause the 
effect. To test these hypotheses rigorously would require considerable statisti- 
cal ingenuity and data collection effort. 


- Most salaries are paid on days 20-25 of the month with the 25th being 
especially popular. This leads to buying pressure on the 25th. 

- There is portfolio window dressing on day —1. 

- Security firms can invest for their own accounts in amounts based on their 
capitalization. Since their capitalization usually rises each month and is 
computed at the end of the month, there is buying as early as on day —3 to 
account for this (because of the three day settlement process on the TSE). 

- Large brokerage firms have a sales push that lasts 7 to 10 days and this 
starts on day — 3. 

- Employment stock holding plans and mutual funds receive money in this 
period to invest, starting around day —3. 

~ People buy mutual funds with their pay which they receive on calendar days 
15 to 25 of the month; then these funds are invested in stocks with a lag, so 
most of the buying occurs on days —5 to +2. 


5. Holiday effects 


There is a holiday effect in Japanese securities similar to that in U.S. 
securities as documented by Ariel (1988). See also French and Roll (1986), 
Lakonishok and Smidt (1989), Zweig (1986) and Ziemba (1991). The mean 
return on all days on the NSA from 1949-1988 is 0.048% per day. On the 
pre-holidays the return is about five times as high: 0.246% per day. Moreover 
the risk as measured by the standard deviation is also lower: 0.794% versus 
0.979% per day, respectively. As in the U.S. there are no abnormal gains on 
the days around holidays except for the pre-holiday trading day. For example, 
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Table 13 

Holiday effects on the NSA, 1949-1988. 
Day Sample Minimum Maxi- Mean Standard t- 

size retum, % mum return, deviation, Statistic 

return, % % 
% 
Pre-holiday 408 — 5.4069 3.0631 0.2461 *** 0.7943 6.26 
Non pre-holiday 7143 —14,9009 11.2893 0.0489 *** 0.9786 4.22 
Days after 
holidays 408 — 8.6856 4.7122 0.0068 0.9893 0.14 


Pre-holidays, 
Fridays and Saturdays 
preceding market 


closing 2268 —6.6101 6.8765 0.1561 *** 0.7242 10.27 
Days after holidays 
and weekends 2268 — 8.6856 5.3938 — 0.0480 * 0.9473 -2.41 


Source; Yamaichi Research Institute. 


the days following holidays had returns of 0.0067% per day on the NSA during 

1949-1988 which is less than a typical day. A regression shows this: 

R = 0.0352 +0.0799 Day — 3 +0.0222 Day — 2 +0.1894 Day — 1 —0.0663 Day + 1 +0.00114 Day + 2 
(3.745) (1.491) (0.424) (3.709) (- 1.334) (0.023) 


where R is the daily return and the effects of trading days —3 to +2 are 
separated out using 0, 1 variables for these coefficients. Only the pre-holiday 
with a mean return of 0.1894% per day is statistically significant with a ¢-static 
of 3.709. 

If we lump Fridays without Saturday trading and Saturdays in as special 
pre-holidays then the average return over the thirty-nine years is still 0.156% 
per day and the trading day after special and regular holidays has negative 
returns namely —0.048%. This is mostly the effect of the negative Mondays. 
See Kato, Schwartz and Ziemba (1989) or Kato (1990a). In Japan, Tuesdays 
are negative, on average, but not when there is Saturday trading the previous 
week. Then Monday is strongly negative, on average. These results are 
summarized in table 13. 

Table 14 shows that pre-holidays improve every day of the week. Lakonishok 
and Smidt (1989) found similar results in their 90-year Dow Jones Industrial 
Average study. Even Mondays and Tuesdays are positive if they are pre-holi- 
days. Wednesdays, Thursdays, Friday and Saturdays have extremely high 
returns if they are pre-holidays. These days have positive returns about 70% of 
the time. Saturday trading has varied considerably over the years with the 
trend towards fewer and fewer open Saturdays. During the sample period, 
Saturdays were open about 90% of the weeks. Saturdays were closed for 
trading as of February 1989. For more on the day-of-the-week effects in 
Japan, see Kato (1990a), Kato, Schwartz and Ziemba (1989) and Amihud and 
Mendelson (1989). 
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Table 14 
Mean daily return, standard deviation and percent positive returns on pre-holidays by days of the 
week of the NSA, 1949-1988. * 


Sample Mean Standard t- P- Median % Positive 
size retum, % deviation, Statistic Statistic return returns 
% 
All Mondays 1937 —0.589 ** 0.9500 -—2.73 0.0064 0.0189 50.9 
All Tuesdays 1959 —0.0419 0.9586 -—1.93 0.0532 -0.0373 47.4 
All Wednesdays 1960 0.1164 *** 0.9722 5.30 0.0001 0.1212 56.9 
All Thursdays 1959 0.0871 *** 1.0234 3.77 0.0002 0.0676 55.1 
Alt Fridays 1959 0.0544 ** 0.9174 2.62 0.0087 0.0700 54.6 
All Saturdays 1755 0.1409 *** 0.7043 8.38 0.0001 0.1483 61.9 
Mon. before holidays 59 0.3106 ** 0.7698 3.10 0.0030 0.3788 76.3 
Tues. before holidays 59 0.0103 0.7866 0.10 0.9198 0.1192 57.6 
Weds. before holidays 58 0.1600 1.0388 1.17 0.2456 0.1955 69.0 
Thurs. before holidays 60 0.1868 ** 0.7860 2.82 0.0064 0.2986 73.3 
Fri. before holidays n 0.3821 *** 0.7944 4.05 0.0001 0.3411 74.6 
Sat. before holidays 101 0.2758 *** 0.6242 4.44 0.0001 0.2468 64.4 
1 256 0.2097 *** 0.7324 4.58 0.0001 0.1574 62.5 
2: 1703 0.0310 0.9401 1.36 0.1731 0.0522 53.4 
3 18 0.2648 0.8151 1.38 0.1859 0.4115 77.8 
4 56 0.1895 * 0.5595 2.53 0.0141 0.2139 66.1 


1 = Fridays before a no-trading Saturday; 2 = Fridays before a trading Saturday; 3 = Fridays 
before no-trading Saturday and no-trading Monday; 4 = Saturdays before no-trading Monday 
(Saturdays before a Monday holiday). 


Source: Yamaichi Research Institute. 


Table 15 shows the effect of the pre-holidays as separate from the day-of- 
the-week effects. The regression estimates show that the pre-holiday effect is 
the largest, about 0.246% per day. Mondays and Tuesdays are negative as 
expected. Since these estimates use the entire 39 years of the NSA, Wednes- 
days’ returns are slightly less than Saturdays’ and these two days return the 
bulk of the week’s return. All the coefficients are significant at the 5% level. 


Table 15 
Day of the week and pre-holiday effects on the NSA, 1949-1988. 
Effect Mean Standard t-Statistic p-Statistic 
return, % deviation, 
% 
Monday —0.0705 ** 0.0214 — 3.29 0.0010 
Tuesday ~ 0.0435 * 0.0213 — 2.04 0.0412 
Wednesday 0.1151 *** 0.0213 5.40 0.0001 
Thursday 0.0807 *** 0.0213 3.79 0.0002 
Friday 0.0420 * 0.0214 1.97 0.0492 
Saturday 0.1327 *** 0.0229 5.81 0.0001 
Pre-holiday 0.2461 *** 0.0460 5.35 0.0001 


Source: Yamaichi Research Institute. 
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Table 16 
Mean daily return, standard deviation on pre-holidays, NSA, 1949-1988. 
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Holiday Advances Mean Standard Minimum Maximum (Statistic 
return, % deviation, return, % return, % 
% 
January 1 32/39 0.2457 0.7625 — 3.38 1.269 2.01 
January 15 32/39 0.4711 ** 0.7157 — 2.04 1.947 4.11 
February 11 15/22 0.3011 * 0.5326 — 0.42 1.810 2.65 
February 20 21/39 0.0687 0.6213 — 1,50 1.912 0.69 
April 29 25/39 0.2272 0.8313 — 2,96 2.731 171 
May 3 31/39 0.4989 *** 0.6700 —0.74 3.063 4.65 
May 5 30/39 0.3884 *** 0.6184 — 0.66 2.188 3.92 
September 15 14/22 0.0154 0.7309 — 2.48 0.670 0.10 
September 23 27/39 0.2341 ** 0.4866 —0.92 1.310 3.00 
October 10 12/22 — 0.2487 1.4389 — 5.41 1.256 ~0.81 
November 3 24/39 0.2987 * 0.8684 -1.27 2.770 2.15 
November 23 25/39 0.2414 0.9711 —2,18 2.585 1.55 
Source: Yamaichi Research Institute. 
Table 17 
Mean return per day around the time of the golden week NSA, 1949-1988. 
Date Sample Mean t-Statistic p-Statistic 
size return, % 
April 20 32 — 0.1285 —0.73 0.4691 
April 21 32 — 0.0234 —0.18 0.8617 
April 22 33 0.0576 0.34 0.6963 
April 23 34 — 0.0024 —0.01 0.9888 
April 24 33 0.1133 1.23 0.2291 
April 25 34 0.1339 0.92 0.3664 
April 26 33 0.2922 * 2.10 0.0433 
April 27 33 0.1277 0.74 0.4639 
April 28 34 0.2148 1.41 0.1685 
April 30 31 — 0.1418 — 0.47 0.6448 
May 01 33 0.4312 * 2.69 0.0113 
May 02 34 0.5359 *** 4.47 0.0001 
May 04 30 0.3255 ** 2.96 0.0060 
May 06 31 — 0.0976 — 0.60 0.5516 
May 07 34 0.0903 0.87 0.3892 
May 08 33 — 0.1326 -0.79 0.4260 
May 09 33 — 0.1174 —0.91 0.3671 
May 10 32 — 0.0350 -0.23 0.8170 
May 11 32 0.0282 0.28 0.7924 
May 12 33 — 0.0701 — 0.70 0.4905 
May 13 33 — 0.1336 — 1.02 0.3134 
May 14 33 — 0.2421 -1.75 0.0889 
May 15 31 — 0.2111 -1.70 0.0989 
May 16 32 — 0.0635 —0.50 0.6235 
May 17 32 — 0.2209 -1.39 0.1755 
May 18 33 — 0.3210 * — 2.20 0.0355 
May 19 34 — 0.2898 ~1.47 0.1509 
May 20 32 0.0081 0.06 0.9551 
May 21 32 0.3553 1.57 0.1274 
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Table 18 
Rates of return for the 39 turn-of-the years, NSA, 1949-1988, 
Trading Mean Standard t-Statistic p-Statistic 
day return, % deviation, 
% 
-15 0.0292 0.7566 0.24 0.8108 
-14 — 0,603 0.9834 —0.38 0.7037 
-13 — 0.2631 1.2660 -1.29 0.2022 
-12 0.2210 1.9265 0.72 0.4782 
-11 —0.1036 0.7214 — 0.90 0.3754 
-10 0.0202 1.2598 0.09 0.9209 
-9 —0.1197 1.3817 —0.54 0.5918 
-8 0.0347 1.2604 0.17 0.8643 
-7 0.1413 1.1679 0.76 0.4538 
~6 0.1738 0.8303 1.31 0.1990 
-5 0.1303 0.7532 1.08 0.2868 
-4 0.4683 *** 0.6526 4.48 0.0001 
-3 0.4413 * 1.1038 2.49 0.0170 
-2 0.0613 0.9883 0.39 0.7005 
-1 0.2457 0.7625 2.01 0.0513 
1 0.0369 1.0594 0.22 0.8289 
2 0.1539 1.0529 0.91 0.3670 
3 0.3812 1.2409 1.92 0.0626 
4 0.2460 1.0516 1.46 0.1523 
5 0.4067 * 0.9277 2.74 0.0093 
6 0.0856 0.7856 0.68 0.5005 
7 0.2546 1.0541 1.51 0.1397 
8 0.0985 0.8342 0.74 0.4655 
9 0.4007 ** 0.8602 2.91 0.0060 
10 0.2736 * 0.6824 2.50 0.0167 
11 0.2243 0.7166 1.95 0.0580 
12 0.0375 0.7256 0.32 0.7489 
13 0.1101 0.7036 0.98 0.3309 
14 0.2180 * 0.6674 2.04 0.0484 
15 ~ 0.0334 1.4171 —0.14 0.8837 


The standard deviations are coincidently extremely close for all days of the 
week except for the pre-holidays. 

Are some pre-holidays better than others? Table 16 investigates this for the 
twelve holidays. Most of the pre-holidays have high returns. The exceptions 
are September 15 which is just barely positive and October 10 which is 
negative. This is not surprising as the earlier research showed very poor 
returns in these months over the years. 


6. The golden week effect 


In late April and early May there are three holidays in a one-week period 
referred to as The Golden Week. The holidays are on April 29, May 3 and May 
5. Hence, one would expect that April 28, May 2 and May 4 would have 
strong returns. Also, in relation to the Christmas holiday period, as Lakonishok 
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and Smidt (1989) found for the DJIA in the U.S., other days in this period 
likely will have large returns as well. The results for the NSA 1949-1988 data 
displayed in table 17 shows that the three pre-holidays have high returns. April 
28 returns, on average, 0.215%, May 2 a whopping 0.536% and May 4 0.326%. 
The in-between day, May 1, a +1 day, and a —2 pre-holiday, returns an 
extremely high 0.431%. The post-holiday day is poor; not surprisingly it 
returns a negative 0.142%. It is an exception to the observed fact that the —1 
day of the month is extremely positive. A change in 1989 may affect this. This 
is the closing of the market on May 4 to make a three-day break. With only 
two instead of three pre-holidays the total return in this period may be 
sufficient for —1 to be strong as usual. This was the case in 1989 and 1990. 


7. Turn-of-the-year effect 


Based on our discussion of the turn-of-the-month effect, one would expect 
the turn-of-the-year effect to begin around day —5 and run to the middle of 
January. The data shown in table 18 indicate that the effect seems to start on 
day —7 when the mean return is 0.142% per day. The effect then has positive 
returns on every trading day until +14. Many of the days have phenomenal 
average returns: —4 is 0.468%, —3 is 0.441%, —1 is 0.246%, +3 is 0.381%, 
+5 is 0.407%, +7 is 0.255%, +9 is 0.401%, +10 is 0.274%, +11 is 0.224%, 
and +14 is 0.218%. This is with the 39 years of NSA data from 1949-1988. 
There are no small-firm indices readily available in Japan. Given the results 
discussed earlier that the small firms would probably have even higher returns. 
The cumulative average return on these 21 trading days is about 3% for the 
NSA. 

The —1 day, the final trading day of the year, is the start of the U.S. 
turn-of-the-month effect, how important is it in Japan? Its return, by itself, is 
about 0.198% per day, or more than four times the average day’s return of 
about 0.0476% as shown by the regression model: 


Mean return % t-Statistic p-Statistic 
bo Typical day 0.0476 *** 5.47 0.0001 
bi —I's 0.1981 1.32 0.1856 


where 


l 1 for — 1 day of the month 
b= ‘ 
0 otherwise 


The following regression model separates out the —1 of the year effect, 
from the —1 of each other month and the pre-holiday effect. The results show 
that each of these three effects is quite similar in size, returning about 0.2% per 
day. At least one third of the total mean returns amounts to ~—1’s and 
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pre-holidays which is similar to what Lakonishok and Smidt (1989) found for 
the Dow Jones Industrial Average over the 90 years from 1897 to 1986. 


Mean return t-Statistic p-Statistic 
bo Typical day 0.0341 *** 3.78 0.0002 
bi —1 of the year 0.2116 1.42 0.1568 
by — 1 of the month 0.1822 *** 3.95 0.001 
by Pre-holidays 0.2121 *** 4.30 0.0001 


b = p for last trading day of December 
0 otherwise 


, 


and pre-holidays ' 


1 for-1 day of the month except for January 
bz 
0 otherwise 


| 1 pre-holidays except-1 day of year 
0 otherwise 


8. Conclusion 


The results show that the seasonality regularities in Japan during the 
1949-88 period were quite similar to the corresponding effects in U.S. security 
markets. Differences occur because of alternative institutional and cultural 
patterns in Japan. For example, the turn-of-the-month effect which indicates 
higher returns on trading days —1 to +4 in the U.S. is paralleled in Japan 
with days —5 to +2. The returns on these days were about two thirds of the 
total monthly return. The rest of the return in each country was earned on 
days +5 to +9 in the U.S. and +3 to +7 in Japan. For the balance of the 
month in each country, the returns are statistically Jess than or equal to zero. 
The turn-of-the-year effect was also similar to that in the U.S. except it was 
longer in December and in January. 

The holiday effect is also similar with strong gains on pre-holiday trading 
days and negative returns on the post-holiday trading days. The Golden Week 
effect in early May is unique to Japan. It is interconnected with the holiday 
effect since during the sample study period 1949-88 there were three holidays 
during Golden Week. The strongest days of the year in Japan as in the U.S. 
are the pre-holidays and the —1 days for all twelve months. These days have a 
mean return of about 0.20% per day. 

The small stock January effect in Japan was similar to that in the U.S. 
despite the fact that there were no capital gains for individuals during the 
period. Japan also has a strong June effect for small stocks. Large bonuses 
paid to workers just prior to these stock rises seem to be at least a part of the 
cause of the high returns in these months. 

Students and practitioners of the Japanese stock market can utilize the 
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seasonal regularities for several purposes. One is in the construction of 
seasonality calendars that rank the days of the years on a +4 (the best) to — 4 
(the worst). This is discussed in Ziemba and Schwartz (1991). The evidence is 
that such seasonality calendars do show violations of market efficiency with 
the high rated day having significantly higher returns than the low rated days 
at high levels of significance. The seasonal regularities along with the small 
stock effect can also be used to construct anomaly portfolios which is also 
discussed in Ziemba and Schwartz (1991). 
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Chapter 12 


SEASONALITY EFFECTS IN JAPANESE FUTURES MARKETS* 


William T. Ziemba 


Yamaichi Research Institute, Tokyo 
University of Tsukuba and 
University of British Columbia 


This paper investigates seasonal regularities in the security price 
returns on the first section of the Tokyo Stock Exchange. The research 
uses data from the futures markets in Singapore for the Nikkei Dow 
225 index and in Osaka for the Kabusaki 50 index. The questions of 
main concern are whether or not the seasonal anomalies observed in 
the spot markets are maintained, are they anticipated in the futures 
markets, and do the futures market anticipations alter the character of 
the seasonal regularity. Results are presented concerning day of the 
week, monthly, holiday, turn of the month and year and first half of 
the month effects. The conclusions are tentative because the futures 
markets have only a short history to date. 


1. INTRODUCTION 


The Japanese country and economy were devastated in WWII. After the war 
the country was occupied and plans were set for a rebuilding. The stock 
markets reopened in 1949. The early years were difficult but the hard work 
and internal savings for investment paid off as the country's products gained 
more and more acceptance abroad. The full fruits of the economic miracle of 
Japan are now well known and feared in the west. Since 1980 the transfer of 
wealth to Japan from the U.S and others has been very large indeed. Figure 1 
shows this through the world stock market capitalizations in October 1980 and 
September 1988. Europe's share of the world's stock markets has stayed about 
constant (21% in 1988 versus 20% in 1980), the U.S. share has dropped from 


*Without implicating them I would like to thank my colleagues at the Yamaichi Research 
Institute particularly A. Komatsu and H. Shintani for their help and useful discussions on 
anomalous behavior in Japanese security markets. Thanks are also due to Warren Bailey and 
Sandra Schwartz for helpful comments on an earlier draft of this paper. This research was 
conducted at the Yamaichi Research Institute with the advice of William T. Ziemba. All the 
rights to the research belong to the Yamaichi Research Institute. I thank the Yamaichi 
Research Institute for permission to publish the results. This research was also partially 
supported by the Social Sciences and Humanities Research Council of Canada, and the Centre 
for International Business Studies, University of British Columbia. 
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53% to 31% and the rest of the world's is now 6% compared to 12% in 1980. 
Meanwhile, Japan's share has increased from 15% to 42% Net assets of Japan 
were $11.5 billion at the end of 1980 and those of the U.S. were $106.3 billion. 
By 1987 Japan's net assets grew over 20 fold to $240.7 billion. Meanwhile U.S. 
private and public assets fell by a negative $368.2 billion. There is much talk in 
the press about the U.S. trade and budget deficits. Indeed the public's deficit 
was $148.9 billion at the end of 1987. But the private sector of the U.S. is also 
$219.3 billion in the red. 


Rest 


Japan 


Rest 


Europe 


Japan 
Europe 


U.S. 


October 1980 September 1988 


Figure 1: Stock Market Capitalizations 


There has been a huge increase in Japanese stock prices. A standard measure 
of the market is the Nikkei Dow index. This index is analogous to the Dow 
Jones Industrial Average in the U.S. Its value at any time is the sum of the 
prices of the 225 stocks in the index divided by the current divisor which is 
adjusted over time to account for stock splits, rights offerings, etc. The index 
stood at ¥176.21 on the date of its original investment, May 16, 1949. By the 
end of the year the index had fallen to ¥109.9. But by the end of December 1988 
the index had increased to ¥30,159. The divisor which began at 225 was then 
10.289. The increase over the 38+ years was 171.15 times not counting 
dividends and taxes. In U.S. dollars the increase was a remarkable 489.59 
times. The increase has not been straight up. Over this period there have 
been twenty corrections of 10% or more and nine of over 20%. For details see 
Ziemba and Schwartz (1990). Table 1 shows the closing prices of the ND index 
plus the value of the yen/dollar exchange rate year by year plus the gains in 
yen and dollars for the ND index. 


The major stock market in Japan is the Tokyo Stock Exchange. The TSE has 
over 85% of the value and trading volume of the whole market which is also 
traded in Osaka, Nagoya and five other smaller exchanges. The stocks in the 
Nikkei Dow comprise about 50% of the market value and about 75% of the 
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Table 1: Yearly Yen Closing Prices of the Nikkei Dow and 
Yen/Dollar Exchange Rates, 1949-88 
Current Value of 
1¥ invested 1$ invested 


End of Year ND Yen/$ in 1949 in¥ in 1949 in$ 
1949 109.9 360.00 0.62 0.62 
1950 101.9 360.00 0.58 0.58 
1951 166.1 360.00 0.94 0.94 
1952 362.6 360.00 2.06 2.06 
1953 377.9 360.00 2.14 2.14 
1954 356.1 360.00 2.02 2.02 
1955 425.7 360.00 2.42 2.42 
1956 549.1 360.00 3.12 3.12 
1957 474.5 360.00 2.69 2.69 
1958 666.5 360.00 3.78 3.78 
1959 874.9 360.00 4.97 4.97 
1960 1356.7 360.00 7.70 7.70 
1961 1432.6 360.00 8.13 8.13 
1962 1420.4 360.00 8.06 8.06 
1963 1225.1 360.00 6.95 6.95 
1964 1216.5 360.00 6.90 6.90 
1965 1417.8 360.00 8.05 8.05 
1966 1452.1 360.00 8.24 8.24 
1967 1283.5 360.00 7.28 7.28 
1968 1714.9 360.00 9.73 9.73 
1969 2359.0 360.00 13.39 13.39 
1970 1987.1 360.00 11.28 11.28 
1971 2713.7 314.80 15.40 17.61 
1972 5207.9 302.00 29.56 35.23 
1973 4306.8 280.00 24.44 31.42 
1974 3817.2 300.95 21.66 25.91 
1975 4358.6 305.15 24.74 29.18 
1976 4990.8 292.80 28.32 34.82 
1977 4865.6 240.00 27.61 41.42 
1978 6001.8 194.60 34.06 63.01 
1979 6569.5 239.70 37.28 55.99 
1980 7116.4 203.00 40.39 71.62 
1981 7681.8 219.90 43.59 71.37 
1982 8016.7 235.00 45.50 69.69 
1983 9893.8 232.20 56.15 87.05 
1984 11542.6 251.10 65.50 93.91 
1985 13113.3 200.50 74.42 133.62 
1986 18701.3 160.05 106.13 238.72 
1987 21564.0 123.00 122.38 358.18 
1988 30159.0 125.85 171.15 489.59 

April 19. 1989 33185.15 132.36 188.32 512.22 


trading volume on the first section of the TSE. The second section has only a 
few percent of the value and volume so the first section and the Nikkei Dow 
give onea good indication of the whole market. The first section's 
capitalization is currently about Y490 trillion which is more than all the stock 
exchanges in the U.S. The ND index suffers from the same problems as the 
DJIA. A more representative index of the whole market is the TOPIX which is 
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a value weighted index of all, some 1135, stocks on the first section. It is 
analogus to the NYFE index in the U.S. Relative to the TOPIX, the ND has a 
beta (using data from Octover 1983 to October 1988) of about 0.84 so it is less 
variable than the overall market. The TSE is highly regulated with detailed 
and cumbersome listing procedures, price and margin limits and the like. 
Trading is dominated by the institutions including the life insurance 
companies and made largely through the big four: Nomura, Nikko, Daiwa 
and Yamaichi. Although there are some 45 foreign brokerage firms in Japan 
they handle only about 2% of the trading volume. In contrast, Nomura alone 
has about 30% of the trading volume. Although the foreign firms are very 
active in program trading and the like the lack of commission business has led 
to massive losses. Only ten of the 45 had profits in 1988 and the profits of 
these were small compared to those of the big four.! 


The Japanese stock market is not well understood and is therefore avoided in 
the U.S. Despite sharply rising prices and a firming yen, there was net selling 
by foreigners from 1984 to mid 1988. Fears of extraordinary high PE ratios, 
land prices beyond belief and the like abound as has been described in Ziemba 
and Schwartz (1990). A few key points are: 


¢ there are extraordinary high correlations between land prices particularly of 
commercial land and stock prices, see also Ziemba (1989b) 


+ much of the stock holdings are to cement business relationships and are 
never traded 


e there is a great preference by the Japanese corporations and individuals to 
accumulate land and a strong feeling not to sell it ever 


e the value of the stocks is much less, even with their high PE ratios based 
on trailing earnings of some 50-70 times, than the value of the land and 
stock holdings they own not to mention the value of the assets in 
businesses they are in. Tobin's Q, the percent of the assets that the stocks 
are worth is about 70% for the ND. 


> the PE ratios are not generally as useful a measure of stock value as in the 
U.S. because of the nontrading, undervaluation of land and other 
buildings, stock holding and other assets. Still there is a small but 
significant low PE effect separate from other valuation descriptors 


The impending crash that many U.S. and other foreign observers see as 
inevitable may take a while to materialize if at all. The confidence level of the 
Japanese in their markets is high and for good reason. Sharply rising earnings 
growth, GNP growth of about 6% in 1988, 4.5% in 1989 (estimated), low 


Japanese firms are not doing well in the U.S either. 
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interest and inflation rates, a strong yen, huge balance of trade and the like are 
cited as reasons (and used as explanatory variables in econometric prediction 
equations) for this strength. Japanese investors prefer to buy on weakness 
rather than on strength. That is the reverse of the majority of U.S. investors. 


The stock market is opening up quickly with a whole host of derivative 
instruments recently added and planned. There are now four index futures 
contracts trading and options on futures indices are scheduled to be traded 
later in 1989. Details on these contracts and their early histories appear in 
Ziemba and Schwartz (1990). 


There is not much literature yet on Japanese anomalies.3 Part of the reason for 
this is a lack of interest in such studies by the big Japanese brokerage firms who 
are unaware and suspicious of their potential use. Also these firms do not 
generally publish much of their research findings even in Japanese. Recently, 
however, U.S. and Japanese researchers have been studying these markets. 
The thrust has been mainly to ascertain the similarities and differences with 
the analogous results in U.S. markets. Some of the main references are 
Hawawani (1988a,b), Jaffe and Westerfield (1985a,b), Kato (1988a,b), Kato and 
Schallheim (1985), Kunimura (1984), Ikeda (1985, 1988) and Nakumura and 
Terada (1984). Kato, Schwartz, and Ziemba (1989) have surveyed the research 
and presented new results on day of the week effects in Japanese security 
markets. Other research works on the Japanese stockmarket are Bailey (1989), 
Brenner, Subrahmanyam and Uno (1987, 1988), Elton and Gruber (1988, 1989), 
Hamao (1988, 1989), Kunimura (1984), Hiraki, Aggarwal and Rao (1988), 
Komatsu and Ziemba (1989), Lau, Quay and Ramsey (1974), Roehl (1985), 
Pettway and Tapley (1984), Suzuki (1988) and Schoenfeld (1988). 


Ziemba (1989a) had access to data on the entire 38+ years of the ND and TOPIX 
indices and used it to study holiday, monthly, turn of the month and year, first 
half of the month and Golden Week effects on the first section of the TSE. 
There are strong and pervasive security market regularities that with some 
changes because of the institutions involved are analogous to many U.S. find- 
ings. The purpose of this paper is to investigate how the new index futures 
markets are affecting these anomalies. The specific data used are from two of 
these futures contracts: the SIMEX, ND225 futures traded on the Singapore 
International Monetary Exchange and the Kabusaki 50 index traded on the 
Osaka Stock Exchange. These contracts began trading on September 3, 1986 and 
June 6, 1987, respectively. The newest futures contracts, based on the TOPIX 
traded on the TSE, and the ND traded on the OSE began trading on September 


2For a survey of the U.S. derivative instruments and some of the theory behind them see 
Rubinstein (1987). 
3The U.S. literature on seasonal regularities is now extremely voluminous. Surveys and list of 


references appear in Schwert (1983), Keim (1986), Jacobs and Levy (1988) and the books by 
Dimson (1988) and Ziemba (1990). 
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3, 1988. In this paper we investigate the SIMEX, and Kabusaki data up to 
September 19 and 20, respectively, which is 557 trading days for the SIMEX and 
352 for the Kabusaki. 


Figure 2 shows the volume on the SIMEX and Kabusaki 50 during and after 
the period of this study. Similarly, Figure 3 shows the volume of the two new 
contracts. The main point is that the new contracts essentially replaced the 
Kabusaki 50 and its trading volume shrank to almost nothing and it has 
remained at this low level of activity. Meanwhile the SIMEX had an increase 
in volume part of which is increased programmed trading. The papers by 
Brenner, Subrahmanyam and Uno (1987, 1988) study arbitrage programmed 
trading strategies between the SIMEX and Kabusaki for the early part of the 
history before September 1988. See also Chapter 13 in Ziemba and Schwartz 
(1990). The Kabusaki is a price weighted package of 50 stocks traded on the TSE 
and OSE designed to track the ND. It has a daily limit price move of 3% in 
either direction. Hence in a market crash, it does not trade. The SIMEX has no 
such price limits and had a huge fall on crash day October 20, 1987. It now has 
a daily price limit change of 15% 


Singapore time is one hour behind Tokyo time, hence to correspond to 
Tokyo's 9:00 a.m. opening, the SIMEX begins trading at 8 a.m. During the time 
of this study, the TSE was open for trading from 9:00-11:00 a.m. and 1:00-3:00 
p.m., and 9:00-11:00 a.m. on the first, fourth and fifth (if there is one) Saturdays 
of the month. As of the beginning of February 1989 there was no Saturday 
trading on the TSE with trading in the afternoon from 12:30-3:00 pm with 
extra trading on the previously closed days December 28 and 29. SIMEX does 
not close for lunch and trades continuously from 8 a.m. to 2:15, an extra fifteen 
minutes after Tokyo's close. This is much the same as the extra time futures 
on the S&P 500 and other indices trade following the close of the New York 
Stock Exchange. Most of the trading days on the TSE and SIMEX are 
comparable but occasionally one of them has a different holiday. In the initial 
period until May 23, 1987 SIMEX did not trade on Saturdays. 


The SIMEX has a spot contract for the ND that is rarely traded, plus four 
futures contracts that mature on the quarter months, as do the Kabusaki and 
U.S. futures contracts, of March, June, September and December. The contracts 
mature on the third Wednesday of the month and they are settled in cash 
based on the closing value of the ND. The Kabusaki is settled on the fifteenth 
day of each contract month with the delivery of the basket of stocks. 
Throughout most of its trading history the SIMEX traded at a discount from 
fair value and often the Kabusaki was at a premium. There simply was not 
enough arbitrage trading to keep the prices in line. I refer the reader to the 
cited references and also Schoenfeld (1988) who among other 
accomplishments was the first westerner to trade on the SIMEX. 
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Figure 3: Trading Volume of the ND and TOPIX Futures Contracts Traded on 


the OSE and TSE, September 3, 1988 to January 31, 1989. 
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The plan of the paper is to describe in turn each of the spot anomalies and 
then investigate their current effects in the futures and spot markets. The 
conclusions reached are tentative because these data sets are still small but 
they are suggestive. In general, the anomalies still seem to be there, although 
they are anticipated in the markets to some extent and this has shifted their 
effects somewhat. 


2. DAYS OF THE WEEK EFFECT 


A survey of the days of the week effect in the spot Japanese stock markets 
along with some new results investigating the ND and TOPIX indices for the 
entire 1949-88 period appears in Kato, Schwartz and Ziemba (1989). The major 
research papers investigating the day of the week effects using data from the 
1970s and 1980s are Ikeda (1985, 1988) in Japanese, Kato (1988a,b), Kato and 
Schallheim (1988), and Jaffe and Westerfield (1985a,b). Some of the main 
conclusions of this research are: 


° There is a strong day of the week effect. Wednesdays and Saturdays are 
strongly positive, Thursdays and Fridays are mildly positive, Mondays are 
about neutral, and Tuesdays are especially negative. Until the end of 
January 1989, Saturday trading occurs for a half day on the first, fourth, and 
(if applicable) the fifth week of the month. 


e For the period 1978-87, Wednesdays gain 0.145% and Saturdays 0.14% each 
about triple the average gain of 0.058%. This is counterbalanced by the 
nearly even Mondays which return 0.004% on average and a loss of 0.09% 
on Tuesdays. Thursdays and Fridays return 0.065% and 0.105%, 
respectively. 


° Except for Saturdays which open up and rise all day, on average, most of 
the gains occur at night. Except for a brief rise early in the day and a little 
kick at the end all the other days have negative returns during trading 
hours. 


e Saturday trading affects the market in significant ways. Indeed the market 
seems affected by whether or not Saturday trading occurs. For example, 
with Saturday trading on the first, fourth and fifth weeks of the month 
during the sample period, the following Mondays are positive. But 
without Saturday trading Mondays are negative. Tuesdays are always 
negative except in the third week of the month when there is no Saturday 
trading both in this week and the proceeding week. In this case Mondays 
are especially negative losing 0.35% on average while Tuesdays gain back 
0.19% of this. 


e In total, Monday-Tuesday trading losses in Japan are about 0.09% which is 
similar to the Monday losses coupled with Tuesday's mildly positive gain 
in New York for a Monday-Tuesday return of about -.11%. Correlations 
between the New York and Tokyo's Dow's vary by time period but these 
relationships are more or less becoming stronger over time. What is 
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consistent is that New York's effect on Tokyo is stronger than the reverse. 
For the period 1980-87 the correlation between the previous day's return in 
New York and the close-to-open returns in Tokyo was 53.6%. The reverse 
effect, the correlation between Tokyo's close-to-close and New York's close- 
to-open is only about 11%. 


Table 2 sumarizes the recent effects using 1978-87 data and the effects of 
Saturday trading. 


Table 2: Effects of Saturday Trading on the TOPIX, April 4, 1978-June 18, 1987 


Trading Ends 
This Last Sample 
Week Week Week Mon Tues Wed Thur Fri Sat Size 


Ist,5th A1 Sat Sat 0.0198 -0.1072 0.1790 0.0581 0.0605 0.1678 1406 
4th A2 Sat Fri -0.1008 -0.1102 0.1074 0.0306 0.1582 0.0793 649 


2nd B1 Fri Sat 0.1135 -0.0649 0.0980 0.0933 0.1341 539 

3rd B2 Fri Fri -0.3489 0.1931 0.2479 0.2609 0.2193 58 
All Weeks 0.0039 -0.0902 0.1449 0.0648 0.1049 0.1397 2652 

Sample Size 449 464 464 465 467 343 2652 


(Source: Kato 1988a) 


Pieptea and Prisman (1988) and others such as Cornell (1985) have found that 
in the U.S. markets Mondays fall is fully anticipated in the futures markets 
using the S&P500 index. The futures market tends to fall in the last hour or so 
of trading so that even as the spot market is rising the futures are moving 
lower to anticipate Monday's decline. For the period March 18, 1983 to June 
27, 1986 Pieptea and Prisman found the following results: 


Mon Tues Wed Thur Fri 
Mean Annualized Return 
___in Spot Market : -4.93 42.33 3.16 23.50 32.59 
iie Standard Deviation 211 245 241 197 223 


Mean Annualized Return 
in Futures Market of the 


i pera n 19.35 9.10 483 23.57 6.75 
Me eee ee 2.09 2.46 2.48 2.07 2.12 


Figure 4 and Table 3 compare the day of the week effects on the spot ND in 
Tokyo with the futures market on the Nikkei Dow at the SIMEX in Singapore. 
Similarly Figure 5 and Table 4 compare the effects on the Osaka 50 futures, the 
Osaka 50 spot and the Nikkei Dow spot. 
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Table 3: Days of the Week Effects in the ND Spot Market in Tokyo and the 
SIMEX Futures Markets in Singapore, Sept 3, 1986 to Sept 19, 1988 


SIMEX futures ND spot 

Sample Mean % Sample Mean % 
Day Size Return t-Statistic Positive Size Retum t-Statistic 
Positive 
All 532 0.094 100 56.8 557 0.082 1.50 56.6 
Mon 102 -0.223 -1.52 46.1 100  -0.301* -2.93 44.0 
Tues 104 -0.137 -0.47 57.7 99 -0.120 -0.68 50.5 
Wed 101 0.029 0.20 58.4 100 0.251 1.74 60.0 
Thur 101 0.534 1.90 59.4 101 0.305* 3.08 64.4 
Fri 99 0.207 1.39 62.6 102 0.174 1.54 59.8 
Sat 25 0.383 1.09 56.0 55 0.258* 1.96 63.6 


In this and succeeding tables, * indicates that a coefficient is significantly 
different from zero at the 5% level, ** at 1% and *** at 0.1%. 


Figure 4: Days of the Week Effects in the ND Spot Market in Tokyo and the 
SIMEX Futures Markets in Singapore, Sept 3, 1986 to Sept 19, 1988 


Simex Futures 
HA ND Spot 
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Table 4: Days of the Week Effects in the ND Spot Market and the Osaka 50 
Spot Market in Tokyo and the Osaka 50 Kabusaki Futures in Osaka, 
June 6, 1987 to Sept 20, 1988 


Osaka 50 Futures Osaka 50 Spot 
Sample Mean % Sample Mean 
Day Size Return t-StatisticPositive Size Return t-Statistic % 
Positive 
All 352 -0.020 -0.30 49.4 353 0.000 0.00 52.1 
Mon 65 -0.329* -2.35 38.5 65 -0.347* -2.50 38.5 
Tues 62 0.205 1.20 54.8 63 -0.173 -0.62 60.3 
Wed 64 0.018 0.10 51.6 64 0.275 1.15 53.1 
Thur 63 -0.012 -0.09 50.8 63 0.173 1.31 55.6 
Fri 64 0.007 0.04 46.9 64 -0.025 -0.15 46.9 
Sat 34 0.018 0.08 58.8 34 0.190 0.90 64.7 
ND Spot 
Sample Mean % 
Day Size Return% t-Statistic 
Positive 
All 353 0.033 0.45 56.4 
Mon 65 -0.254 -1.93 46.1 
Tues 63 -0.196 -0.75 52.4 
Wed 64 0.318 1.57 59.4 
Thur 63 0.204 1.79 63.5 
Fri 64 0.059 0.39 56.3 
Sat 34 0.107 0.62 64.7 


Figure 5: Days of the Week Effects in the ND Spot Market and the Osaka 50 
Spot Market in Tokyo and the Osaka 50 Kabusaki Futures in Osaka, 
June 6, 1987 to Sept 20, 1988 
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The spot effect of weak Mondays and Tuesdays and strong Wednesday to 
Saturday is continuing. Wednesdays are the strongest days followed by 
Thursdays and Saturdays in the spot market. The Simex futures in Singapore 
look just like the spot market in Tokyo except in the Wednesday -Thursday 
period. Possibly this market is following rather than leading, although other 
evidence suggests the leading hypothesis. The futures seem to rise and fall 
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with the spot. The Osaka 50 seems to anticipate the days of the week effect. 
Saturdays hardly rise at all in the futures markets despite the spot gains thus 
anticipating Monday's fall. Then these futures fall on Monday to anticipate a 
further fall on Tuesday. They then rise on Tuesday to anticipate the gains on 
Wednesday, Thursday, Friday and Saturday. They are then flat until the fall 
the next Monday. 


3. HOLIDAY EFFECTS 


The main references on the holiday effect in U.S. spot markets are Ariel (1988), 
Lakonishok and Smidt (1989), Zweig (1986) and the survey in Ziemba (1990). 
These authors found a very strong effect: days before holidays have 
significantly higher returns than other days. The effect is relatively strongest 
for large capitalized stocks although the total preholiday returns is higher for 
the small stocks because of the small firm effect. Indeed for the 90 year 
Lakonishok and Smidt DJIA sample from 1897-1986 these eight or so days 
return more than half the non-dividend return over the whole year. Trading 
days before the preholiday and after the holiday do not have significantly 
higher returns. Indeed these days particularly the post holidays have lower 
returns on average. Ziemba (1989a) has investigated similar effects on the 
TSE. In Japan there are about twelve holidays each year compared with about 
eight in the U.S. For the Nikkei Dow from 1949-88, I found that the mean 
daily returns could be estimated by 


R = 0.0352 + 0.0799 Day_5 + 0.0222 Day.) + 0.1894Day_, - 0.0663 Day ,; + 0.00114 Day 


+2, 
(3.745) (1.491) (0.424) (3.709) (-1.334) (0.023) 


where Day-3 is the dummy for the separate effect of trading day -3, etc. The 
pre-holiday with an extra mean return of 0.1894% per day is statistically 
significant with a t-static of 3.709 and none of the other days have returns that 
are significantly different from bg. The effect of the +1 day is negative, 
although the coefficient is not significant. Their total returns are slightly 
positive, 0.0068% which is less than a typical day. In total, the preholidays 
return 0.2246 per day versus 0.049% on a typical non-preholiday with lower 
risk measured by the standard deviation, 0.794% versus 0.979%. The 
preholidays also improve every day of the week, for the Japanese stocks see 
Ziemba (1989a), just as Lakonishok and Smidt (1989) found for the DJIA. 


Figure 6 and Table 5 show the spot holiday effect on the ND and on the SIMEX 
futures market for the period of the SIMEX data, September 3, 1986 to 
September 19, 1988 some 557 trading days. The preholidays have gains on 78% 
of the days which is much higher than for the other days. The mean returns 
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are high, averaging 0.276% per day which is nearly four times the return on 
other days. However, the t statistic of 1.57 is not significant except at the 13% 
level. Moreover, the third day before the holiday has even higher returns. 
The other days around the preholiday have low returns as expected. 


In the futures markets the SIMEX anticipates the holiday effect on the 3rd day 
before the holiday. This anticipation seems to affect the spot market so the 
spot prices rise on -3 taking out some of the gains that would normally occur 
on the preholiday. The return in the SIMEX futures market on the preholiday 
is in fact slightly negative again reinforcing the fact that the preholiday effect is 
anticipated. 


Table 5: Holiday Effect on SIMEX Sept 1986-Sept 1988 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Day Size Return% t-statisticPositive Size Returnn% t-statistic % Positive 
All 557 0.082 1.50 56.6 508 0.098 1.00 56.9 
3rd PH 21 0.370 1.38 57.1 17 0.684 1.62 64.7 
2ndPH 22 -0.083 -0.30 50.0 20 0.014 0.04 45.0 
PH 23 0.276 1.57 78.3 18 0.225 0.48 66.7 
AH 23 -0.001 0.00 56.5 21 0.445 1.25 61.9 
2nd AH 21 0.092 0.48 52.4 21 0.320 1.07 66.7 
Others 447 0.071 1.12 55.9 411 0.063 0.55 56.0 


Figure 6: Holiday Effect on SIMEX Sept 1986-Sept 1988 
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Figure 7 and Table 6 investigate the holiday effect from the Kabusaki Osaka 50 
index for the period June 6, 1987 to September 20, 1988. In the spot market on 
the TSE the effect is similar to the historical record. The preholiday returns 
average 0.50% per day and the index increases 75% of the time. The gain is 
significant at the 5% level. The third trading day before the holiday has high 
returns but they are not significant. In the futures markets the return is only 
significant at the 5% level on the preholiday. The returns were high around 
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the holiday periods in this sample with the days outside the pre and post 
holidays having negative returns. There is no evidence of the future market 
anticipating the holiday effect. 


Table 6: Holiday Effect on the Kabusaki 50 Sept 1986-Sept 1988 


Kabusaki 50 Spot on TSE Kabusaki 50 Future on TSE 
Sample Mean % Sample Mean 

Day Size Return t-StatisticPositive Size Return t-Statistic % Positive 
All 353 -0.001 0.00 52.1 352 -0.020 -0.30 49.4 
3rd PH 12 0.340 0.68 58.3 12 0.324 0.61 41.7 
2ndPH 11 0.021 0.04 45.5 11 -0.206 -0.42 45.5 
PH 12 0.499** 3.68 75.0 12 0.391* 2.27 66.7 
AH 12 -0.022 -0.07 50.0 12 0.284 1.03 50.0 
2nd AH 11 0.243 0.74 63.6 11 0.324 0.68 63.6 
Others 295 -0.043 -0.46 50.8 294 -0.070 -0.94 48.6 


In the spot market, the holiday effect for this sample of data of a year plus is 
not working in the usual fashion. The preholiday is strong but not dominant 
compared to the other days around the holiday. In fact for the SIMEX trading 
period the ND spot has risen most on the 3rd day before the holiday not on the 
preholiday. For the period that the Osaka 50 has been trading, the preholiday 
has the highest returns on the ND and these results are highly significant. 
The Osaka 50 also seems to have a strongly positive effect on the -3 preholiday 
trading day although these coefficients are not significant. So there may be 
some anticipation of the effect but with the small sample one cannot conclude 
much. 


Figure 7: Holiday Effect on the Kabusaki 50 Sept 1986-Sept 1988 
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4. TURN OF THE MONTH AND FIRST HALF OF THE MONTH EFFECTS 


The U.S. turn of the month effect occurs on trading days -1 to +4, see Ariel 
(1987) for an analysis of the spot data from 1963-82 and Sick and Ziemba (1989) 
for an analysis of futures data from 1982-88. The gains in this period are very 
high and contribute a very large percentage of the total monthly returns. The 
return in this week, coupled with that in the second week of the month, 
roughly trading days +5 to +8 or +10, essentially amounts to all the gains in 
the spot stock market during 1960-88. The rest of the month is essentially 
noise and at best provides zero returns. The reason(s) for this effect are not 
fully known but part of the story is that people receive their salaries on or 
around the -1 day as well as their stock account statements so they have funds 
to invest in stocks. There are also the portfolio balancing and renewal effects. 
When would one expect the turn-of-the-month to be in Japan? Many 
companies pay their salaries around the 25th of the month. Indeed, Ziemba 
(1989a) found similar turn of the month and first half of the month effects on 
the TSE. These effects are in the trading periods -5 to +2 and +3 to +7. For the 
ND from 1949-88 the period with trading days -5 to +2 receives the bulk of the 
month's return and each of these day's return is significant at the 5% level or 
better. Days +3 to +7 have positive returns and the rest of the month is noise; 
see Table 7 and Figure 8. Because of the pecularities of the Japanese market 
including Saturday trading day, +22 corresponds to a -5 day once a year or for 
39 times in that many years. Since Japan had more trading days because of the 
two or three Saturdays that were trading each month these periods are 
correspondingly lengthened. In the first half of the month the mean return is 
0.01142% per day while in the second half it is an insignifiant 0.00048% per 
day. 
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Table 7: Turn-of-the-Month Data, 1949-1988 


Trading Sample Mean t- Trading Sample Mean t- 
Da Si Return, % tatistic Day Si Return, % Statistic 
-5 471 -0899* 2.75 9 471 -1065* 2.46 
-4 471 -1041* 2.65 10 471 .0620 1.58 
-3 471 .1733"* 3.82 11 471 -0395 1.05 
-2 471 .1334** 3.18 12 471 -.0196 -0.50 
-1 471 -2255*** 5.36 13 471 .0115 0.24 
1 471 -0980* 2.43 14 471 -.0042 -0.10 
2 471 -1006* 2.47 15 471 -.0306 -0.54 
3 471 0307 0.72 16 471 .0716 1.35 
4 471 0592 1.15 17 471 -.0498 -1.20 
5 471 0358 0.81 18 429 -.0207 -0.49 
6 471 -.0005 -0.01 19 350 0162 0.36 
7 471 0357 0.09 20 229 -.0286 -0.48 
8 471 -.0585 -1.34 21 118 -.0476 -0.46 
22 39 .0562 0.45 


trading days 
ist Half 22-7 .01142 
2nd Half 8-21 00048 
Source: Yamaichi Research Institute reported in Ziemba (1989a). 


Figure 8: Turn-of-the-Month Data, 1949-1988 
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The turn of the month effect with high returns on days -5 to +2 seems to still 
be there but the SIMEX and Osaka 50 futures markets totally anticipate the 
effect. On the SIMEX, days -8 to -5 have all the gains of about 2.8% per month 
on average. This is a huge average gain in only four days. This is the whole 
turn of the month effect with the rest of the period about even. Although 
lower than in the futures markets, the spot market also has large gains on days 
-8 to -5. But the spot gains in the -5 to +2 period are still abnormally high. On 
the Osaka 50 futures market the anticipation seems to be on days -7, -6 and -5 
with a total gain of about 1%. Both markets lose back much of the gain on -4 
and -3. The effect in the spot market is similar to that mentioned above for 
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the ND. The SIMEX seems to lead the Osaka 50 futures and the spot ND in 
this effect. Table 8 and Figure 9 describe the results. Since there is only one or 
two turn of the years this data is lumped in with that for the other months. 
These were calculated separately, but this does not seem to change the results 


Table 8: The Turn of the Month Effect on the ND 


much. 
ND Spot on TSE 
Sample Mean % 
Day Size Return t-StatisticPositive 
-7 24 0.432* 2.17 66.7 
-6 24 -0.095 -0.40 62.5 
-5 24 0.333* 1.99 62.5 
-4 24 0.347* 2.00 62.5 
-3 24 -0.100 -0.40 54.2 
-2 24 0.315 1.10 70.8 
-1 24 0.161 0.79 50.0 
1 24 0.142 0.69 66.7 
2 24 0.689 0.31 54.2 
3 24 0.160 0.55 41.7 
4 25 0.261 1.61 68.0 
5 25 0.112 0.69 60.0 
6 25 -0.869 -0.64 40.0 
7 25 0.963 0.46 56.0 
8 25 -0.046 -0.02 56.0 
9 25 0.534* 2.50 76.0 
10 25 0.227 1.09 64.0 
11 25 -0.097 -0.47 56.0 
12 25 -0.341 -1.93 40.0 
13 25 -0.019 -0.10 48.0 
14 24 0.154 0.79 58.3 
15 24 -0.601 -0.92 54.1 
16 24 0.446 1.04 62.5 
17 24 0.022 0.92 62.5 


Sample 
Size 


ND Futures on SIMEX 
Mean 
Return 

22 0.668* 2.18 
21 -0.061 -0.19 
22 0.625* 1.96 
23 -0.356 -1.02 
18 0.445 0.15 
20 0.349 0.84 
18 -0.448 -0.90 
19 0.256 0.76 
21 0.314 1.32 
24 0.355 0.99 
21 0.103 0.45 
24 0.244 1.15 
20 -0.106 -0.51 
23 -0.438 -0.14 
25 0.252 0.97 
25 0.297 1.40 
24 0.358 1.75 
25 0.158 0.07 
25  -0.383* -2.12 
24 0.043 0.17 
22 0.933 0.35 
23 -0.923 -0.72 
21 0.012 0.07 
24 1.472 1.32 


t-Statistic % Positive 


68.2 
57.1 
54.5 
47.8 
61.1 
65.0 
33.3 
57.9 
61.9 
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Figure 9: The Turn of the Month Effect on the ND 
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8 15 -0.115 -0.41 53.3 
9 16 0.517 1.86 75.0 
10 16 0.493 1.75 81.3 
11 16 0.118 0.94 62.5 
12 16 -0.306* -2.16 43.8 
13 16 0.011 0.07 43.8 
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t-Statistic % Positive 


Osaka 50 Futures on OSE 
Sample Mean 
Size Return 
15 0.270 1.26 
15 -0.082 -0.21 
15 0.702 1.41 
15 -0.548 -1.45 
15 -0.253 -0.82 
15 0.131 0.27 
15 0.054 0.19 
15 -0.062 -0.16 
15 0.121 0.40 
15 0.120 0.24 
15 0.086 0.29 
15 0.049 0.19 
15 -0.305 -1.27 
15 0.117 0.28 
15 -0.037 -0.20 
16 0.361 1.62 
16 0.446 1.86 
16 0.028 0.18 
16 -0.259 -1.33 
16 -0.015 -0.07 
16 -0.274 -1.11 
14 0.007 0.02 
15 -0.068 -0.24 
15 -0.255 -0.69 
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Figure 10: The Turn of the Month Effect on the Osaka 50 


0.800 
0.600 
0.400 4 
0.200 4 


ES ratit; į E Osaka 50 Spot on TSE 
HO fa E 


0.000 + i a ‘ 

0.200 7785-3 -1 2 4 '6 8 1012 1431 -®- Osaka 50 Futures on 
pS , 3 s sp OSE 

-0.400 

-0.600 s 

-0.800 

-1.000 


5. FIRST HALF OF THE MONTH 


The effect that most or all of the gains on the TSE occur on trading days -5 to 
+7 seems to be working according to the script in the spot market. Itis as well 
in the Osaka 50 futures market. The SIMEX futures though seems to at least 
partially smooth out the returns. This first half has higher returns by a 50% 
margin but it does not have all the gains. The following strategy would have 
been profitable during the sample period: buy the SIMEX on the close of -9, 
sell on the open of -5; on the Osaka 50, buy on the close of -8 and sell on the 
open of -5. 


Table 10: Test of the hypothesis that first half of month has all the returns. 
half N Mean % Std Dev Std Error 
1 292 15 1.023 0.0598 
2 265 .008 1.355 0.0942 


The F test with 264 and 291 degrees of freedom indicates that with F=2.25 the 
variances are equal at the 0.1% level. The t test with equal variances then 
indicates that the first half of the month has all the gains with 19.24% 
confidence using a pooled variance with 555 degrees of freedom. 
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Table 11: The First Half of the Month Effect on the ND 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Day Size Return% t-Statistic Positive Size Return t-Statistic % Positive 
All 557 0.082 1.50 56.5 508 0.098 1.00 56.9 
ist Half 292 0.150* 2.51 57.1 253 0.119 1.29 54.9 
2nd Half 265 0.008 0.08 55.8 255 0.077 0.45 58.8 


Figure 11: The First Half of the Figure 12: The First Half of the Month 
Month Effect on the ND Effect on the Osaka 50 
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Table 12: The First Half of the Month Effect on the Osaka 50 


Spot on TSE Futures on OSE 
Sample Mean % Sample Mean 
Day Size Return t-StatisticPositive Size Return. t-Statistic % Positive 
All 553 -0.000 -0.00 52.1 352 -0.020 -0.30 49.4 
Ist Half 180 0.034 0.37 52.8 180 0.018 0.17 48.9 
2nd Half 173 -0.036 -0.25 51.4 172 0.060 -0.71 50.0 


6. THE MONTHLY EFFECT 


Because of its length of about 30 days one would expect the futures markets to 
closely mirror the spot markets over a monthly period. Research on the spot 
market in Japan appears in Hawawani (1988b), the data of Horimoto (1988) 
reported in Ziemba (1989a) along with additional data reported there. These 
results are summarized in Table 13. 
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Table 13: Monthly Effects on the Tokyo Stock Exchange 


Hawawani (1988) Horimoto Ziemba (1989a) 
373-566 stocks (1988) ND 225 1949-88 
Jan 1955-Dec 1985 All TSE 1st Price Weighted 
Equally Weighted Section, 1965-87 
Equally 
Weighted 
Average Mean Mean Mean 
Monthly Sample Monthly t Monthly Sample Daily t 
Return Over Size __ Retur _ statistic Return Size Return _ statistic 
All months 372 1.60%* 6.40 1.76 11529 0.0482*** 5.56 
all but January 341 1.24%*** 4.75 1.42 na na na 
January 31 5.58%*** 13.17 5.44 885 0.1816*** 5.80 
February 31 1.00 1.85 1.71 905 0.0549 1.87 
March 31 2.90%** 3.77 2.88 986 0.0457 1.50 
April 31 0.51 0.57 0.82 947 0.0623* 2.10 
May 31 0.18 0.20 1.29 956 0.0074 0.27 
June 31 2.51%*** 4.59 2.83 1011 0.0641* 2.39 
July 31 0.65 0.75 1.12 1043 0.0083 0.27 
August 31 0.92 0.85 1.16 1038 0.0790** 2.96 
September 31 0.13 0.17 0.28 931 0.0059 -0.22 
October 31 0.52 0.48 0.17 998 0.0088 0.24 
November 31 1.81 1.76 1.11 910 0.0371 1.27 
December 31 2.48%** 3.02 2.27 919 0.0470 1.37 


January, March, June and December have had the highest mean returns in 
that order in both Hawawani's and Horimoto's samples. Also September and 
October had the lowest returns. Ziemba's longer data set shows similar very 
high returns in January, and the low September and October returns. But 
there are differences in the other months from the Hawawani and Horimoto 
samples. The sample sizes for Horimoto's data are unavailable. 


There is only one January in the Osaka 50 data and two in the SIMEX data but 
the daily data used do give us some results that are significant at the 5% level. 
Tables 14 and 15 and Figures 13 and 14 describe the results. 
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Table 14: The Monthly Effect on the ND Spot and the SIMEX 


401 


56.6 
69.0 
65.1 
64.6 
60.9 
55.8 
54.2 
57.1 
53.1 
57.4 
44.9 
57.1 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Month Size Return t-StatisticPositive Size Return t-Statistic % Positive 
all days 532 0.094 1.00 56.8 557 0.082 1.50 
Jan 39 0.380* 1.97 61.5 42 0.396* 2.24 
Feb 40 0.226" 2.06 62.5 43 0.232* 2.26 
Mar 44 0.205 1.79 68.2 48 0.164 1.53 
Apr 41 0.254 1.34 63.4 46 0.272 1.87 
May 37 0.204 0.90 59.5 43 0.154 1.02 
jm 46 -0.003 -0.02 60.9 48 -0.032 -0.29 
Jul 48 -0.002 -0.01 50.0 49 0.064 0.40 
Aug 45 0.128 0.87 46.7 49 0.066 0.65 
Sep 57 -0.079 -0.47 54.4 54 -0.027 -0.20 
Oct 47 -0.242 -0.26 51.0 49 -0.283 -0.64 
Nov 43 0.204 0.85 58.1 42 0.134 0.65 
Dec 45 0.014 0.08 48.9 44 -0.066 -0.52 


Figure 13: The Monthly Effect on the ND Spot and the SIMEX 
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Table 15: The Monthly Effect on the Osaka 50 
Osaka 50 Spot on TSE 


ND Spot on TSE 
Sample Mean 


Month Size Return t-StatisticPositive 


Positive 
all days 353 0.033 
Jan 21 0.465 
Feb 20 0.284** 
Mar 24 0.167 
Apr 23 0.204 
May 21 -0.014 
Jun 40 -0.086 
Jul 49 0.064 
Aug 49 0.066 
Sep 36 0.020 
Oct 25 -0.351 
Nov 21 -0.118 
Dec 22 -0.226 


Osaka 50 Futures on OSE 
Sample Mean 


Month Size Retum t-Statistic 


Positive 

all days 353 -0.020 
Jan 21 0.358 
Feb 20 0.244* 
Mar 24 0.162 
Apr 23 0.170 
May 21 -0.280 
Jun 40 0.076 
Jul 49 -0.081 
Aug 49 0.097 
Sep 36 0.008 
Oct 25 -0.494 
Nov 21 -0.121 


Dec 22 -0.256 


% 
0.45 56.4 
1.42 66.7 
3.21 72.7 
1.20 58.3 
1.87 65.2 
-0.09 52.4 
-0.69 52.5 
0.40 57.1 
0.65 53.1 
0.15 61.1 
-0.43 56.0 
-0.31 47.6 
-1.08 36.4 
% 
-0.28 49.4 
0.95 52.4 
2.68 68.2 
1.04 50.0 
1.14 65.2 
-1.50 42.9 
-0.65 47.5 
-0.47 51.0 
0.86 46.9 
0.05 55.6 
-0.84 33.3 
-0.27 42.9 
-0.92 36.4 


Sample 
Size 


Mean 
Return 


-0.000 
0.410 
0.254* 
0.124 


t-Statistic % 


-0.00 
1.07 
2.40 
0.77 
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Figure 14: The Monthly Effect on the Osaka 50 
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7. TURN OF THE YEAR - JANUARY EFFECT 


The turn of the year effect based on the ND for 1949-88 is very strong. Days -7 
to +14 all have positive returns that average 0.142% per day for a cumulative 
return of about 3% in this twenty-one day period. There is only one turn of 
the year in the Osaka 50 data and only two in the SIMEX data. In the spot data 
for the thirty-nine years from 1949 to 1988 trading days -8 to +14 all have 
positive returns. Four of these days have gains in excess of 0.4% and seven 
others have gains of more than 0.2%. See Table 16 and Figure 15. 


For the 1987 and 1988 turn of the years there are modest gains but nothing like 
the long historical record. In addition many of the days have negative returns. 
The +2 and +3 days had a total return of about 9% on the Osaka 50. Volatility 
was very high. The Osaka 50 futures and spot prices are very consistent. All 
in all the Turn of the Year effect did not seem as strong as usual in 1987 and 
1988. Perhaps the October 1987 crash's effect on 1987 was a contributing factor 
to this. 
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Table 16: Rates of Return for the Thirty-Nine Turn of the Years, ND, 1949-1988 
and for the one turn of the year in the Osaka 50 Futures data set 
and the two turns of the year in the SIMEX data set 


ND Spot ND Spot ND Spot 
Trading Mean t- Osaka 50 SIMEX SIMEX Osaka Osaka 50 
Day _ Return, % _Statistic Futures Futures data data Spot 
-15 0.0292 0.24 -0.411 0.110 -0.424 -0.273 -0.411 
-14 -0.0603 -0.38 1.305 0.705 1.330 1.727 1.516 
-13 -0.2631 -1.29 -1.265 -0.236 -0.646 -1.053 -1.535 
-12 0.2210 0.72 0.083 0.279 0.028 -0.476 -0.680 
-11 -0.1036 -0.90 0.413 0.117 -0.284 -0.347 0.259 
-10 0.0202 0.09 -0.165 -0.059 0.324 -0.120 -0.176 
-9 -0.1197 -0.54 0.577 1.176 -0.049 0.352 0.855 
-8 0.0347 0.17 0.000 -0.575 -0.488 -0.318 -0.449 
-7 0.1413 0.76 0.328 0.553 0.835 0.569 0.868 
-6 0.1738 1.31 -0.449 -0.510 -0.746 -0.940 -1.018 
-5 0.1303 1.08 -0.164 -0.266 -0.149 -0.207 -0.184 
-4 0.4683*** 4.48 -0.534 0.063 -0.081 -0.663 -0.327 
-3 0.4413* 2.49 -2.188 0.000 -1.277 -1.874 -2.412 
-2 0.0613 0.39 -3.757 -5.795 -1.204 -2.656 -3.239 
-1 0.2457* 2.01 -0.263 -0.550 -0.246 0.142 0.917 
1 0.0369 0.22 -1.671 1.653 -0.486 -1.609 -2.329 
0.1539 0.91 2.683 2.463 1.153 1.689 1.553 
3 0.3812 1.92 6.272 -0.138 2.567 5.633 6.657 
4 0.246 1.46 -0.656 0.202 -0.165 0.007 -0.312 
5 0.4067* 2.74 0.248 -1.223 0.261 0.353 -0.184 
6 0.0856 0.68 -1.482 -0.473 -1.019 -1.286 -1.389 
7 0.2546 1.51 -0.084 -0.420 -0.232 0.207 0.335 
8 0.0985 0.74 -1.212 2.119 -0.015 -1.326 -1.494 
9 0.4007* 2.91 1.185 0.766 1.597 1.248 1.302 
10 0.2736* 2.50 1.589 0.186 0.780 1.356 1.443 
11 0.2243* 1.95 -0.865 -0.235 0.047 -0.053 -0.144 
12 0.0375 0.32 -0.332 -0.456 0.432 -0.240 -0.119 
13 0.1101 0.98 -0.917 na -0.416 -0.582 -0.357 
14 0.2180" 2.04 0.883 1.007 0.563 0.732 0.285 
15 -0.0334 -0.14 0.542 0.658 0.757 1.087 1.157 


O Petry itp 1 
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Day of the Week Effects in Japanese Stocks 
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Many of the anomalies in the U.S. markets also occur in Japan. Although literature 
on Japanese stock market anomalies is just starting to be written, we do have some 
good independent studies of interday and day of the week, small firm, and January 
effects, as well as the development of data bases and arbitrage pricing equations. In 
this paper, we survey several studies on the day of the week effect.! Other anomalies 
are discussed in Ziemba (1989ab). For other aspects of the Japanese stock market 
see Ziemba (1989c) and Ziemba and Schwartz (1990). 

Jaffe and Westerfield (1985ab) investigated the day of the week effect in the 
Nikkei Dow, a price-weighted index of 225 large capitalization securities traded on 
the first section of the TSE and the TSE 1000 (at that time), a value-weighted index 
of all stocks on the first section, called TOPIX. 

The Nikkei Dow index is computed in a fashion similar to the Dox Jones Indus- 
trial Average. It is a price-weighted index computed by adding the prices of the 225 
component stocks in the index and dividing by a divisor that changes over time 
due to stock splits, rights issues, etc. At the end of December 1988, the divisor 


*Without implicating them William T. Ziemba would like to thank his colleagues, particularly 
A. Komatsu and H. Shintani at the Yamaichi Research Institute for their help and useful discussions 
on anomalous behavior in Japanese security markets. This research was supported by the Yamaichi 
Research Institute and by a grant from the Centre for international Business Studies. University 
of British Columbia. Willaim T. Ziemba’s research on this topic was conducted at the Yamaichi 
Research Institute and he thanks the institute for permission to publish the new results. This 
paper is a modified version of a chapter in Ziemba and Schwartz (1990). 
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that had originally been 225 was 10.287. The index is calculated and announced 
at one-minute intervals during trading hours. At the end of December, 1988, the 
Nikkei Dow closed at its high of the year which was 30,159. 

The Nikkei Dow is a theoretical number and should be distinguished from the 
real average price in the marketplace. The meaning of the level ¥30,159 is as follows. 
Supposing one had bought one share of each of the 225 component stocks listed 
on the Tokyo Stock Exchange at the time of the post-war inauguration of the 
exchange (May 16, 1949), then held them until December 17, 1988, without putting 
additional funds into new shares issued through rights offerings to shareholders. 
Instead one had reinvested the value of every right evenly in all of the component 
stocks. The average price would then have increased from ¥176.21 on the date 
of the original investment on May 16, 1949, to ¥30,159 or about 171 times, not 
counting dividends and taxes. In U.S. dollars the increase is about 495 times since 
the yen/dollar exchange rate was 360 in 1949, closed at 124.58. The average is 
compiled by summing the value of the 225 stocks and dividing by the current divisor. 
The divisor is adjusted by the formula: 


Price Totals Before Ex-Rights Date 
Old Divisor 
Price Totals Before Ex-Rights Date — Value of Rights 
~ New Divisor 
Every time a component stock goes ex-rights, the divisor is reduced. 

The ND suffers from the same criticisms as the DJIA. It is not a true represen- 
tation of the real market since it only has a small percentage of all the stocks on 
the TSE and in addition the price-weighting scheme is not as accurate a measure 
as a value-weighting scheme. In the words of the Tokyo Stock Exchange (1988): 


The Tokyo Stock Exchange stopped computing its Dow formula based 
average and replaced it with TOPIX on July 1, 1969, as the Exchange 
found the average inappropriate for measuring the market for the fol- 
lowing reasons: Firstly, investors are apt to confuse the average with the 
real price, despite the fact that the average is a theoretical arithmetic 
mean enlarged by applying a multiplier. Secondly, as the average is not 
weighted, it tends to be affected by price changes in a handful of smaller 
companies’ stocks trading on high prices. Thirdly, as component stocks 
of the average have virtually been fixed since 1949, except for occasional 
substitutions caused by delistings and company mergers, the average 
does not reflect the change in industrial structure and therefore does 
not serve as an appropriate measure of the market as a whole. On order 
to eliminate all of these problems, the Exchange developed and began 
computing and publishing TOPIX on July 1, 1969. 


TOPIX value-weights all 1135 stocks (at the end of December 1988) on the First 
Section of the TSE. The index is simply the value-weighted average of all the stocks 
on the TSE. Adjustments are made to any corporate activities that affect the current 
market value, other than price changes, via an adjustment to the base value, such 
as new listings, assignment of stocks from the Second Section to First Section and 
vice versa, delisting, rights offerings, public offerings, private placements, mergers, 
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exercises of stock, subscription warrants, or conversion of convertible bonds or pre- 
ferred stock into common stock. In contrast to the actions above, a corporate deci- 
sion that entails no change in the market value of shares of the company has nothing 
to do with the basic market value adjustment. Stock splits, capitalization issues, 
bonus issues, and stock dividends are thus eliminated from the adjustment, as the 
new stock price multiplied by the increased (or decreased) number of shares is the- 
oretically the same as the old stock price multiplied by the old number of shares. 
The formula for the adjustment is 


New Market Value 


New Base Market Value = Old Base Mark l ——— 
ew Base Market Value = O ase Market Value x Old Market Value 


By doing so it is a better measure of overall market movements than the S&P 500 in 
the United States. The corresponding index measures in the United States are the 
NYFE index of all stocks on the New York Stock Exchange or the broader-based 
Wilshire 5000 index. 

TOPIX is computed and published via the TSE’s Market Information System 
every 60 seconds. It is reported to securities companies across Japan and is available 
worldwide through computerized information networks. TOPIX’s initial value was 
set at 100 on July 4, 1968 and its value at the end of December was 2357.03. For 
dates before 1968 TOPIX traded below 100 — in fact it was around 10 near the 
start of the new phase of the TSE in 1949. Currently, the Nikkei Dow has a beta of 
0.84 relative to the TOPIX’s market index of 1.00. Large capitalized stocks (over 
200 million shares outstanding) have a beta of 1.16 and small capitalized stocks 
(less than 60 million shares outstanding) have a beta of only 0.37. The latter is an 
underestimate because of the thin trading of many of these stocks and the fact that 
the TSE’s beta estimates do not use the Dimson or Scholes- Williams procedures for 
estimation in such circumstances. 

Using data from January 5, 1970, to April 30, 1983, on the Nikkei Dow (ND), 
TOPIX and the S&P 500, Jaffe and Westerfield (1985b) found that there was a 
substantial day of the week effect. This is detailed in Table 13.1. 

The United States results indicate typically sharp average losses on Mondays, 
with large gains on Wednesdays and Fridays, and modest gains on Tuesdays and 
Thursdays. In Japan, there are small losses on Mondays and more substantial losses 
on Tuesdays. In sum, Japan’s Tuesday losses are more or less in the ballpark of 
Monday’s United States losses. One would suspect that these losses would be highly 
correlated, especially since Japan’s trading day starts 14 hours before New York’s. 
But, this was not the case during 1970-83; the correlation between Monday’s return 
in New york and Tuesday’s return in Japan is only 0.076 for the ND and 0.004 for 
the TOPIX. 

Corresponding to the Wednesday and Friday high mean returns in the United 
States are large average gains in Japan on Wednesdays and Saturdays. These gains 
are about three times the average daily gain of 0.038% and 0.035% on the ND and 
TOPIX, respectively. Jaffe and Westerfield looked at leads and lags of up to five 
days throughout the week. The lead and lag effects are minuscule with the lead +1 
correlation being 0.072 and 0.086, respectively, for the ND and TOPIX. The same 
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Table 13.1: Percent of Returns on Country Common Stock Indexes by Day of Week. 


Mon Tues Wed Thurs Fri Sat All Days 
U.S. S&P 500 1970-83 
Mean —0.129 0.020 0.097 0.032 0.078 0.021 
Std. dev. 1.015 0.883 0.924 0.823 0.827 0.889 
Japan ND 1970-83 
Mean —0.020 —0.090 0.150 0.026 0.063 0.115 0.038 
Std. dev. 0.876 0.788 0.815 0.875 0.788 0.668 0.817 
Japan TOPIX 1970-83 
Mean —0.014 —0.064 0.124 0.026 0.057 0.099 0.035 
Std. dev. 0.701 0.684 0.671 0.741 0.626 0.530 0.672 


Returns are computed as r: = [(Vi/Vi_1) — 1] x 100, where V; is the value of the country index at 
the end of day t. 
Source: Jaffe and Westerfield (1985b). 


Table 13.2: Percent of Returns on the TOPIX Stock Index by Day of Week. 


Mon Tues Wed Thurs Fri Sat All Days 
Jan. 1, 1977—Dec. 31, 1986: Ikeda (1988) 
Mean 0.039 —0.110 0.153 0.028 0.082 0.138 0.052 
t-value* 1.61 —4.21 6.11 0.95 3.36 6.08 4.88 
April 4, 1978-June 18, 1987: Kato (1988a) 
Mean 0.0039 —0.0902 0.1449 0.0648 0.1049 0.1397 0.0581 
t-value 0.13 —3.08 4.95 2.22 3.60 4.11 4.72 
April 4, 1978—Dec. 31, 1981: Kato (1988a) 
Mean 0.0007 —0.0852 0.1283 —0.0273 0.0834 0.0957 0.0302 
t-value 0.02 —2.58 3.89 —0.83 2.54 2.54 2.17 
Jan. 1, 1982—June 18, 1987: Kato (1988a) 
Mean 0.0061 —0.0935 0.1561 0.1279 0.1195 0.1707 0.0771 
t-value 0.14 —2.15 3.58 2.92 2.74 3.33 4.19 


* All returns are significantly different from zero at the 5% level, except all Monday results and 
Thursdays during 1978 to 1981. 


day correlations are larger being 0.163 and 0.154, respectively. Still it is a minor 
relationship during this time period. 

More recent studies of the day of the week effect in Japan were done by Ikeda 
(1985, 1988), in Japanese, Kato and Schallheim (1985), and by Kato (1988ab). The 
results of Ikeda (1988) and Kato (1988a) are summarized in Table 13.2. These stud- 
ies are based on the TOPIX index which now has 1,135 securities (end of December 
1988 count) on the entire first section of the TSE, for the Periods January 1, 1977, 
to December 31, 1986, and April 4, 1978, to June 18, 1987, respectively. Kato also 
split his data into subperiods from April 4, 1978, to December 31, 1981, and from 
January 1, 1982, to June 18, 1987. In general, these results are consistent with and 
reinforce those of Jaffe and Westerfield. 
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Kato (1988a) tested but did not report results from the Nikkei Dow since they 
were similar to his TOPIX findings. Monday is more or less even, Tuesday is quite 
negative, Wednesday and Saturday are strongly positive, and Thursday and Friday 
are moderately positive. One change in Kato’s study is that Thursdays became very 
positive during 1982-1987, and in this later period they nearly rival Wednesday and 
Saturday. 

The distribution of returns is weighted a bit more to the right on Mondays and 
on Wednesdays to Saturday, and to the left on Tuesdays than a normal distribution 
would be. Kato’s (1988a) data yield the visual display of this in Figure 13.1. 

Kato (1988a) also investigated the intraday and close-to-open effects. Data on 
the TOPIX for each 15 minute period of trading are publicly available in Japan in 
the Japanese Securities newspaper, Nihon Shoken Shimbun. Trading occurs during 
9-11 a.m. and 1-3 p.m. on weekdays and 9-11 a.m. on Saturdays. During the period 
of this study there was trading on Saturday on the first and fourth weeks of the 
month and fifth if there is one. Historically, there was trading on all Saturdays until 
the end of 1972. Then until July 1983 it was closed on the third Saturday, and later 
the second Saturday closed from August 1983-July 1986. Saturday trading stopped 
at the end of January 1989. The trading time lost will be made up with additional 
trading on December 29 and 30 and with an additional half hour of trading in the 
afternoon session which now begins at 12:30. 
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Figure 13.1: Histograms of Returns of the TOPIX, in %, 1978-1987. 
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Figure 13.2: Cumulative Mean Intraday Returns in % for the TOPIX, January 1, 1982—June 1989. 


A visual display of the interday effects appears in Figure 13.2 for the period 
January 1, 1982, to June 19, 1987. We see that all days have in initial surge, except 
for Tuesdays which start out flat. Wednesdays to Fridays are — remember this is all 
averages; if only the market could be this straightforward — roughly flat until they 
have a kick at the end of the day, similar to what Harris (1986) found on the NYSE. 
Mondays lose their initial gains and drift negatively except for the final kick which 
puts them about even. Tuesdays have a pattern similar to Monday but without the 
early gains. They close down about —0.15% despite the final kick. Saturdays are 
very positive throughout the day. They open up and continue increasing until the 
final bell. Interestingly, the nontrading periods are all positive; gains occur during 
the night and the market opens up, even on Tuesdays. Table 13.3 shows Kato’s 
results including the intraday returns using the previous close to the open as the 
starting point. Except for Mondays, which are strongly negative, not much happens 
during the lunch break, although Kato’s data for this period has 15 minutes of 
trading in it so this conclusion is a little muddled. 

The effects of Saturday trading are very interesting. First, Saturdays are 
extremely positive. Second, Saturdays are even more positive if the previous 
week had Saturday trading — at present during the first week of the month 
(see Table 13.4). Third, Mondays are negative if the previous Saturday is closed 


Period 

Close to close 
Close to open* 
Open to close 
9:15-10 a.m. 
10:00-11 a.m. 
11:00-1:15 p.m. 
1:15-2:00 p.m. 


2:00-3:00 p.m. 


*The market opens a 


Trading, Nontrading, and Intraday Returns in % for the TOPIX January 1, 1982—June 18, 1987. 


Friday 


0.1212 
(2.38) 
0.1134 
(5.88) 
0.0069 
(0.16) 
0.0237 
(1.47) 
—0.0627 
(—3.11) 
0.0019 
(—0.16) 
0.0070 
(0.48) 
0.0407 
(2.29) 


Saturday 


0.1983 
3.28) 
0.0487 
2.13) 
0.1490 
2.89) 
0.0278 
1.45) 
0.1210 
5.05) 


All Days 


0.0723 
(3.35) 
0.0855 
(10.47) 
—0.0135 
(—0.74) 
—0.0016 
(—0.23) 
—0.0330 
(—3.84) 
—0.0070 
(—1.295) 
—0.0175 
(—2.65) 
0.0473 
(5.90) 


Table 13.3: 

Statistic Monday Tuesday Wednesday Thursday 
Mean —0.0213 —0.1331 0.1792 0.1201 
T-Value (—0.41 (—2.60) (3.52) (2.35 
Mean 0.1002 0.0013 0.1177 0.1211 
T-Value (5.08 (0.07) (6.11) (6.26 
Mean —0.1217 —0.1344 0.0611 —0.0017 
T-Value —2.73 (—3.07) (1.41) (—0.04 
Mean —0.0040 —0.0671 0.0114 0.0065 
T-Value —0.24 (—4.13) (0.70) (0.40 
Mean —0.0543 —0.0589 —0.0457 —0.0546 
T-Value —0.26 (—2.90) (—2.26) (—2.69 
Mean —0.0514 —0.0097 0.0232 0.0029 
T-Value —4.22 (—0.81) (1.95) (0.24 
Mean —0.0516 —0.0251 —0.0160 —0.0035 
T-Value —3.44 (—1.71) (—1.09) (—0.24 
Mean 0.0387 0.0215 0.0885 0.0466 
T-Value (2.12 (1.20) (4.97) (2.62 

9 a.m., Kato’s close to open is from the previous day’s close to 9:15, a period when most stocks have opened. 
** These F statistics all indicate that the regression model used to fit these returns, namely 


where Di, = 1 for Mondays. D21 = 1 for Tuesdays, 


Source: Kato (1988a) 


a 
Rı = So BiDu + Ey 


i=l 


..., D61 = 1 for Saturdays, and zero otherwise, is significant at the 1% level. 


F 


6.88** 


23.58** 


4.54** 


3.67** 


10.46** 


4.47** 


3.24** 


8.55** 
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Table 13.4: Effects of Saturday Trading on the TOPIX April 4, 1978-June 18, 1987. 


Category Trading Ends 


Week of Month This Week Last Week Monday Tuesday Wednesday Thursday Friday Saturday F Sample Size 

Ist, 5th Al Sat Sat 0.0198 —0.1072 0.1790 0.0581 0.0605 0.1678 8.67** 1406 

4th A2 Sat Fri —0.1008 —0.1102 0.1074 0.0306 0.1582 0.0793 2.53* 649 

2nd B1 Fri Sat 0.1135 —0.0649 0.0980 0.0933 0.1341 4.05* 539 

3rd B2 Fri Fri —0.3489 0.1931 0.2479 0.2609 0.2193 0.46 58 
All Weeks 0.0039 —0.0902 0.1449 0.0648 0.1049 0.1397 

Sample Size 449 464 464 465 467 343 2652 


*At the 5% level and similarly in succeeding tables and **indicates significance at the 1% level. 
Source: Kato (1988a). 
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(the third and fourth weeks) and more so if this week has no Saturday Trading, that 
is, the third week. However, Mondays are positive if Saturday had trading and espe- 
cially so if this week has no Saturday trading, that is, the second week of the month. 

Before we continue snooping into this data, let us pause to see if this makes 
any sense. We know from U.S. data that there is strong one-day serial correlation of 
returns, so if Saturday is positive (and we need it to be open to be so), then Monday 
likely will be positive and we will have the usual negative Tuesday. Suppose Saturday 
is closed, then the Friday is stronger than usual and somehow the long weekend gives 
traders more time to think about selling. Then with Monday so negative, Tuesday 
can be positive but only in weeks that end with Friday trading. For weeks ending 
with Saturday trading the loss is more or less split over the Monday and Tuesday. 

When the anomalous effect does not show up as expected, plan for it for to be 
pushed around a little. The Friday-Friday weeks have the most negative Monday 
following the first Friday, and then the rest of the week is very strong. Is it the sight 
of a positive Tuesday giving the market life? We do not know, but the effect seems 
to be there. Kato’s data covers a period of more than nine years with differing rules 
on Saturday trading so that may affect the results as well. 

Table 13.4 lists the weeks of the month corresponding to the current trading 
data rules so that you can data snoop some more. Kato (1988a) also plots the 
intraday patterns by day of the week using the 1982-1987 data; they have similar 
interpretations. (See Figure 13.3). Kato has a little more information on this, namely 
the effect of the New York Dow on the Tokyo TOPIX. He researched this with and 
without Saturday to explain, in particular, the switching of the negative Tuesdays 
to Mondays when there is no Saturday trading. This is discussed in what follows. 

Ikeda (1985, 1988) and Kato (1988a) discuss the settlement hypothesis as a 
possible explanation of the day of the week effect. The idea is that on certain days 
the time that one has to pay for stock or receive money for sales is different, and 
thus there should be premium for the additional interest foregone or earned from 
this process. The delivery of securities and receipt of payment takes place on the 
third business day after the transaction. See Figure 13.4 for rules on this settlement. 
The results found by Kato and Ikeda, like those in North America, are mixed. This 
is a partial explanation of the day of the week phenomenon. However, the returns on 
Tuesdays are negative except they are strongly positive in the third week, which has 
no Saturday trading, following a week without Saturday trading days is inconsistent 
with this theory. If we let R be the rate of return on the stock in question and i be 
the cost of capital, then the differences in return across days of the week, with and 
without Saturday trading, should be as follows, see Ikeda (1985, 1988) — assuming 
that there are no holidays in this period: 


Mon Tues Wed Thurs Fri Sat 


Saturday Trading 2R+% R+i R+3t R R+i R+i 
No Saturday Trading 3R+i R+i R+3t R+i R+i 


Hence the returns should be highest on Mondays and Wednesdays, with or without 
Saturday trading. The returns should be the same on Tuesdays, Thursdays in weeks 
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Figure 13.3: Cumulative Mean Intraday Returns by Four Group Classified by Saturday Trading, 
TOPIX, 1983-1987. 
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Figure 13.4: Settlement Procedure in the Tokyo Stock Exchange. 
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Figure 13.4: (Continued) 


without Saturday trading, Fridays and Saturdays. One very important implication 
of the settlement rates is that stocks can be purchased on Wednesdays, sold on 
Thursdays and have settlement on the same day, Monday.” This is widely believed 
to have a positive effect on Wednesdays something that is consistent with the data. 
As we can see from the previous results, some of this is consistent with the facts but 
it is far from a proper explanation of the day of the week phenomenon. See Ikeda 
(1988) for some calculations adjusting the returns for this theory. He concludes 
that once adjustment for these settlement differences are made, the day of the week 
differences are less anomalous than before in a statistical sense. 

Since the day of the week effect casts doubt on the homogeneity of the return- 
generating process across the week, not only the mean but higher moments of the 
return distribution should be taken into consideration for each day of the week. Ikeda 
(1988) examined the skewness and kurtosis, and also presented a test regarding “the 
stability under addition property to reject a stable Paretian distribution hypothesis. 
He found that when the interval used for calculating returns is extended from a day 
to a week, a week to two weeks, two weeks to a month, et cetera, the kurtosis and 
skewness become less significant to reject normality, and that leads to use of the 
mixture of normal distributions (or subordinated stochastic process) for the return 
generating process of Japanese stocks. He also presented a model which relates the 
mixture of normals hypothesis to the “information generating process” and applied 
that model to the explanation of the day of the week effect. 

Kato (1988a) re-examined the earlier study by Jaffe and Westerfield (1985b) 
on the correlation between the New York and Tokyo markets with his more recent 
data. For the period 1980-1987, he found much higher correlations, namely 53.6%, 
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Table 13.5: Correlation Coefficients Between the Dow Jones Industrial Average and the Nikkel 
Dow, 1980-87. 


New York 
Tokyo (t = 0) 

New York’s Effects (t = —1) Close-to-open F 0.3306 0.2817 0.3721 
on Tokyo Open-to-close r 0.3823 0.1091 0.2453 
Close-to-close r 0.5363 0.2551 0.4313 
Tokyo’s Effects (t=0) Close-to-open r 0.1556 0.0565 0.1096 

on New York (0.0138) 
Open-to-close r —0.0057 0.0591 0.0484 
(0.8031) (0.0348) 
Close-to-close r 0.0887 0.0929 0.1145 


All coefficients are significant at 1% or lower level except those with p values in ()’s. So the only 
effect that is statistically zero is Open-to-close in New York on t = 0 versus Close-to-open in Tokyo 
on t = 0 which had a small negative correlation. 

Source: Kato (1988a). 


between the previous day’s return in New York and the close-to-open returns in 
Tokyo, using the Dow Jones Industrial Average in New York and the Nikkei Dow 
in Tokyo. This is not surprising since the two markets have grown over time to 
watch the other’s movements very closely. The previous day’s return in New York 
is the most relevant information leading into Tokyo’s opening. The reverse effect, 
namely the correlation between Tokyo’s close-to-close and New York’s close-to-open 
on the next trading day is only 10.96%. So Tokyo’s effect on New York is less than 
New York’s effect on Tokyo. Other correlations are smaller for each directional effect, 
but still statistically significant. Kato’s results are summarized in Table 13.5. Since 
this data covers a relatively long period, 1980-87, one would expect many changes 
in these correlations over time. Additional results on this appear in Hamao, Masulis 
and Ng (1989). 

The trading or nontrading on Saturday coupled with the effects of New York 
trading on Tokyo, provides a method to investigate more fully why there are such 
different Monday and Tuesday effects in these two circumstances. To investigate 
this Kato (1988a) ran the following regression. 

6 
Rot — Rnyt-1 = 5 BkDkrt + € 
K=1 
where Rr, is the return in Tokyo on day t and Ryy,r—1 is the return in New York on 
the previous day, the (’s are the coefficients, and the D;;’s are “dummy” variables to 
pick up the day of the week effects. That is, if t is a Tuesday then Dy; = 0, Da = 1, 
D3, = --- Det = 0, et cetera. and e; is the error in this equation. He found that when 
there is Saturday trading, the F statistic for the fit is 0.32, which is not significant 
even at the 10% level. However, when there is no Saturday trading the F statistic is 
3.48, which is significant at the 1% level. This fit is caused mainly because of the low 
Monday return on the Nikkei Dow relative to the high return on Friday’s New York 
Dow. When there is Saturday trading and Monday is excluded from Tokyo’s data, 
that is, New York is MT'WThF and Tokyo is TWThFS, then the hypothesis that 
the weekly patterns of both markets are identical cannot be rejected. In particular, 
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the low Monday in New York lines up with the low Tuesday in Tokyo, and the high 
Friday in New York matches with the high Saturday in Tokyo. The other days are 
correspondingly similar. Hence, whatever reasons are the true causes for the day of 
the week effect in New York, they are probably similar to those in Tokyo. Perhaps 
in weeks with Saturday trading, investors do not have enough time to vacation and 
contemplate their portfolios leading to Monday sales, so the fall is on Tuesday. See 
Miller (1988) on this. 

The studies discussed above utilize data from the 1970s and 1980s to study 
the day of the week effects, and they aggregate the results over large subsets of 
these periods. Hence, the results are smoothed out and do not show yearly differ- 
ences. Figure 13.5, abcd show the effects year by year for both the TOPIX and the 
Nikkei Dow indices for the 40 years from 1949 to 1988. There is considerable vari- 
ation over time but in general the previous results of low or negative Monday and 
Tuesday returns and high Wednesday to Saturday returns are supported by this 
data. Tuesday was consistently negative from 1973 to 1987. Tuesdays have been 
slightly positive so far in 1988. Mondays are negative or slightly positive through- 
out the whole 40 years. Wednesdays became positive in 1970 and have remained 
positive during each of the past 19 years at levels of about 0.2% per day. Thursdays 
show great variation over the entire sample period with wide swings in average 
returns from year to year, switching form highly positive to negative returns and 
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Figure 13.5: Mean Return by the Day of the Week. 


334 Calendar Anomalies and Arbitrage 


back to positive returns in short periods. Recently they have been more positive. On 
the TOPIX each year, since 1983 shows positive average returns reaching levels of 
0.2% to 0.3% per day in 1986 and 1987. For the Nikkei Dow the returns on Thursday 
are not positive until 1985. Since then they have been very positive and they have 
provided average gains in every year on both indices since 1974. Saturdays are even 
more positive with gains throughout the 40-year sample period except for a brief 
period in the mid-1960s on the Nikkei Dow. 

Table 13.6 investigates the returns during the past four decades in the Nikkei 
Dow by day of the week. We see that: (a) Mondays and Tuesdays were negative 
in the 1950s, with Wednesday, Thursday, and Saturday having progressively higher 
returns. Saturdays returned over four times the average daily returns and provided 
gains nearly two-thirds of the time; (b) in the 1960s, Mondays were still negative 


Table 13.6: Returns by Day of the Week by the Nikkel Dow, by Decade, 1949-1988. 


Standard Percent 
Sample Mean Deviation of of Positive 
Day Size Return % Return % t-Statistic  p-Statistic Returns 

1949-1959 

1 All 3193 0.0565 1.1189 2.85 0.0044 53.6 
2 Mon 531 —0.1545 1.1544 —3.08 0.0022 43.3 
3 Tue 531 —0.0284 1.1120 —0.59 0.5571 50.3 
4 Wed 534 0.0957 1.1860 1.87 0.0627 53.0 
5 Thu 531 0.1488 1.277 2.68 0.0075 56.9 
6 Fri 535 0.0467 1.0986 0.98 0.3258 53.3 
7 Sat 531 0.2302 0.7849 6.76 0.0001 65.5 
1960-1969 

1 All 3000 0.0368 0.8675 2.32 0.0201 52.8 
2 Mon 501 —0.0396 0.8767 —1.01 0.3119 50.5 
3 Tue 499 0.0643 0.8500 1.69 0.0915 50.5 
4 Wed 501 0.0004 0.9075 0.01 0.9919 50.7 
5 Thu 501 0.1221 0.9705 2.82 0.0050 56.7 
6 Fri 495 0.0160 0.8973 0.40 0.6911 51.9 
7T Sat 503 0.0575 0.6686 1.93 0.0543 56.7 
1970-1979 

1 All 2892 0.0391 0.8560 2.46 0.0140 52.3 
2 Mon 488 —0.0135 0.9182 —0.33 0.7453 54.1 
3 Tue 502 —0.0968 0.8129 —2.67 0.0079 44.4 
4 Wed 494 0.1702 0.8454 4.47 0.0001 61.7 
5 Thu 499 0.0091 0.9497 0.21 0.8307 54.5 
6 Fri 496 0.0660 0.8412 1.75 0.0810 55.6 
7 Sat 413 0.1136 0.7018 3.29 0.0011 62.5 
1980-1988 

1 All 2444 0.0625 0.8242 3.75 0.0002 56.0 
2 Mon 417 —0.0135 0.7585 —0.36 0.7169 57.3 
3 Tue 427 —0.1183 1.0205 —2.40 0.0170 43.8 
4 Wed 431 0.2151 0.8696 5.14 0.0001 63.3 
5 Thu 428 0.0606 0.7797 1.61 0.1088 52.3 
6 Fri 433 0.0944 0.7669 2.56 0.0108 58.0 
7 Sat 308 0.1601 0.5933 4.74 0.0001 63.6 


Source: Yamaichi Research Institute. 
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but Tuesdays were slightly positive. Thursdays were again strong but Wednesdays 
were essentially even. Saturdays had gains but only 50% higher than all days com- 
pared to 350 + % higher in the 1950s; (c) Mondays and Tuesdays were negative 
in the 1970s with Tuesdays now switching to be the more negative day by a 7 
to 1 ratio. Only 44% of Tuesdays had positive returns versus 55% on all days. 
Wednesdays replaced Thursdays as the strong midweek day and Saturdays were 
again very strong. Both Wednesdays and Saturdays had positive returns on more 
than 60% of the time; (d) the 1980s are quite similar to the 1970s with very negative 
Tuesdays, mildly negative Mondays, stronger Thursdays (than the 1970s), as well 
as stronger Fridays. Wednesdays again were the strongest days, returning 0.215% 
on average and providing positive returns 63.3% of the time. Saturdays were again 
very positive as well, returning 0.16% per day and being positive 63.6% of the time. 
The Nikkei Dow results are, not surprisingly, quite similar to the TOPIX results for 
the 1970s and 1980s discussed earlier. 

Obviously many of the anomalies affect one another and interact. Kato (1988a) 
has investigated the interaction of the January and small firm effects by days of the 
week to attempt to isolate the separate influences of these anomalies using TOPIX 
data from January 4, 1974, to June 8, 1987. His results are dialed in Tables 13.7 


Table 13.7: Mean Portfolio Close-to-close, Close-to-open, and Open-to-close Returns by Day 
of the Week and Firm Size on the TOPIX, January 4, 1974—June 18, 1987. 


Firm Size Monday Tuesday Wednesday Thursday Friday Saturday F 


Close All Firms 0.0544 —0.0406 0.2188 0.0604 0.1119 0.1489 30.67** 


to Smallest 0.1249 0.2040 0.2803 0.1157 0.1583 0.2059 51.73** 
close 2 0.0855 —0.0277 0.2325 0.574 0.1348 0.1674 34.15** 
3 0.0519 —0.0472 0.2114 0.0287 0.1297 0.1590 27.80** 
4 0.0500 —0.0491 0.1899 0.0321 0.1067 0.1459 24.10** 
Largest 0.0340 —0.0672 0.1591 0.0357 0.1053 0.1447  17.41** 

F's 10.20** 4.00** 83.32** 7.20** 31.23** 61.86** 

N 3225 3350 3350 3340 3360 2460 
Close All Firms 0.1766  —0.0483 0.1176 0.1510 0.1252 0.0532 116.95** 
to Smallest 0.2616 0.1030 0.1844 0.2084 0.1673 0.1381 191.77** 
open 2 0.2324 0.0805 0.1537 0.1942 0.1577 0.0961 174.54** 
3 0.2039 0.0679 0.1325 0.1639 0.1509 0.0813 144.77** 
4 0.1661 0.0393 0.0952 0.1207 0.1106 0.0409 87.02** 
Largest 0.0959 —0.0113 0.0425 0.0793 0.0707 0.0022 23.25** 

F's 234.77** 33.92** 127.10** 197.06** 102.22** 43.91** 

N 3225 3350 3350 3340 3360 2460 
Open All Firms —0.1234  —0.0889 0.1008 —0.0908 —0.0134 0.0952 30.25** 
to Smallest —0.1367 —0.0792 0.0952 —0.0929 —0.0093 0.0676 28.98** 
close 2 —0.1469 —0.1082 0.0783 —0.1368 —0.0230 0.0711 35.23** 
3 —0.1519 —0.1151 0.0785 —0.1352 —0.0214 0.0775 33.57** 
4 —0.1160  —0.0884 0.0944 —0.0888 —0.0040 0.1048 25.27** 
Largest —0.0621 —0.0559 0.1164 —0.0440 0.0346 0.1422 18.13** 

F's 50.21** 23.84** 25.11** 34.00** 1.50**  46.07** 

N 3225 3350 3350 3340 3360 2460 


Source: Kato (1988a). 
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and 13.8 and Figure 13.6. Among his findings: The strong days — Wednesday, 
Friday, and Saturday — have positive gains during the day which essentiallly nullify 
the losses on the weekdays — Monday, Tuesday, and Thursday. Only on the very 
strongest days — Wednesday and Saturday — are there positive returns during the 
day for all levels of capitalized stocks outside of January. 


Table 13.8: Mean Returns in Percent for Trading and Nontrading, Periods on the TOPIX, 
January 4, 1974—June 8, 1987. 


Firm Size Monday ‘Tuesday Wednesday Thursday Friday Saturday All Days F’s 
January Returns 

Open to close returns: 

Smallest 0.4450 0.2976 0.4416 0.3970 0.4198 0.1402 0.3801 25.88** 
2 0.3118 0.2056 0.3664 0.2870 0.3500 0.2138 0.2932 15.66** 
3 0.1972 0.1559 0.3066 0.2270 0.3252 0.1596 0.2326 9.85** 
4 0.1293 0.1113 0.2759 0.1518 0.2455 0.1392 0.1779 6.42** 
Largest 0.0592 —0.0225 0.2499 0.0415 0.2791 0.1321 0.1242 4.56** 
Close to close returns: 

Smallest 0.4003 0.2353 0.2929 0.2713 0.2827 0.1522 0.2781 48.58** 
2 0.3523 0.1687 0.2220 0.2365 0.2162 0.1396 0.2263 39.82** 
3 0.2829 0.1568 0.1685 0.1987 0.2097 0.1101 0.1914 26.98** 
4 0.2085 0.1284 0.1381 0.1565 0.1587 0.3417 0.1432 16.88** 
Largest 0.1281 0.0358 0.0879 0.0793 0.1529 0.0361 0.0893 5.96** 
Open to close returns: 

Smallest 0.0440 0.0618 0.1481 0.1250 0.1364 0.0875 0.1015 4.33** 
2 —0.0408 0.0367 0.1439 0.0500 0.1335 0.0738 0.0665 2.51** 
3 —0.0858 —0.0010 0.1377 0.0280 0.1152 0.0493 0.0410 1.99** 
4 —0.0794 —0.0172 0.1374 —0.0050 0.0867 0.0973 0.0345 2.08** 
Largest —0.0691 —0.057 0.1617 —0.0380 0.1259 0.0957 0.0348 2.65** 
All Months but january 

Close to close total returns: 

Smallest 0.0962 0.0005 0.2661 0.0914 0.1350 0.2030 0.1290 39.72** 
2 0.0652 —0.0477 0.2208 0.0377 0.1156 0.1634 0.0894  26.98** 
3 0.0389 —0.0646 0.2030 0.0116 0.1122 0.1589 0.0731 23.19** 
4 0.0429 —0.0629 0.1824 0.0218 0.0943 0.1465 0.0675 10.46** 
Largest 0.0317 —0.0711 0.1512 0.0352 0.0898 0.1457 0.0602 14.47** 
Close to open returns: 

Smallest 0.2492 0.0916 0.1749 0.2030 0.1570 0.1369 0.1697 158.46** 
2 0.2217 0.0729 0.1477 0.1906 0.1524 0.0924 0.1482 145.83** 
3 0.1968 0.0603 0.1293 0.1609 0.1457 0.0788 0.1304 123.10** 
4 0.1623 0.0316 0.0915 0.1176 0.1063 0.0408 0.0936 73.87** 
Largest 0.0930 —0.0153 0.0385 0.0793 0.0633 0.0007 0.0447 19.36** 
Open to close returns: 

Smallest  —0.1529 —0.0913 0.0906 —0.1117 —0.0223 0.0659 —0.0409 32.17** 
2 —0.1564 —0.1207 0.0725 —0.1529 —0.0370 0.0708 —0.0590 37.66** 
3 —0.1578 —0.1249 0.0733 —0.1493 —0.0335 0.0799 —0.0574 34.60** 
4 —0.1194 —0.0946 0.0906 —0.0960 —0.0120 0.1055 —0.0262 24.33** 
Largest —0.0615 —0.0558 0.1124 —0.0445 0.0265 0.1462 0.0153 -16.17** 


Source: Kato (1988a). 
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Figure 13.6: Mean Returns by the day of the Week, Close-to-Close, Close-to-Open, and Open- 
to-Close, TOPIX, 1974-1987. 


All the gains are at night and there are positive returns on every trading day. 
Moreover, the small cap stocks make their major gains then. Then in total we have 
the advantage of the small stocks over the big stocks day by day in the close-to-close 
returns on every day of the week regardless if the day is positive like Wednesday or 
Saturday or negative as on Tuesdays. 

There is little discussion here or anywhere else of convincing reasons why this 
anomalous day of the week as well as other security market regularities occur. 
Indeed, even in the books on anomalies such as Coulson (1987), Dimson (1988), 
Hirsch (1986), and Ziemba (1990), the authors are fairly silent on this point. 
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Connolly (1988a,b) argues with much data and strong statistical tools that the day 
of the week effect in the United States is not as clear cut as previously believed and 
may not have even existed recently in United States security markets. 

The main reason anomalies are anomalies is that we cannot explain them. 
Indeed, the various anomalies have pestered hardline financial economists to such 
an extent that they are now quite fashionable and such results can now be published 
and indeed are encouraged in the most prestigious journals in the field. The anoma- 
lies, of course, do seem to violate traditional equilibrium models of asset behavior 
over time. They do have causes which we do not understand. Some thoughts along 
these lines regarding day of the week effects appear as follows. Obviously the rig- 
orous testing of the validity and percent explained by the various hypotheses will 
entail considerable effort and statistical ingenuity. Initial research on this appear in 
Hiraki, Aggarwal, and Rao (1988) and Komatsu and Ziemba (1989). 


Some possible Reasons for Day of the week Anomalies: 


1. Negative Tuesdays with Saturday trading 


e Investors are more likely to sell if they have a weekend to think it over 

e Sales representatives have a more difficult time getting buy orders after 
Saturday, especially with a rise on Saturday 

e You must sell on Tuesday if you need money on Friday because of the three 
day business settlement rules since there are no Saturday settlements 

e There is a lag from New York’s Monday fall via the correlation effect 

e It takes time to get into the market again in the new week 


2. Negative Mondays after Saturdays closed 


It takes time to get into the market again in the new week 

Sales on Monday provide receipts before the end of the week 

Investors are more likely to sell if they have a weekend to think it over 
Dealers have a harder time to round up buy orders 

Individuals are net sellers especially of small capitalized stocks. Their sell 
orders accumulate over the weekend and outweigh the buy orders of brokers 
recommendations since brokers are not soliciting orders on the weekend. Hence, 
contrary to other days of the week, there is net selling on Monday; see Miller 
(1988). 


3. Positive Wednesdays 


e According to the settlement hypothesis, Wednesdays should be very strong 

e Some brokerage dealers buy on Wednesday then sell on Thursday. The set- 
tlement for both is on Monday. Since transaction costs are almost zero and 
Wednesdays are typically strong this can be a profitable strategy. This is the 
only day that brokers can use this strategy. 
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4. All Gains Overnight 


e Orders are collected after the close of trading for open by salesmen’s push with 
a bias towards buy orders 

e There is cross selling late in the day to recover funds to match purchases made 
on the open 

e Speculative purchases are made at the open expecting the market to rise then 
there is covering at the end of the day for these trades 


Ziemba (1989b) investigates seasonality effects in Japanese futures markets 
including days of the week effects using the SIMEX, Nikkei Dow 225 and Osaka 
Kubusaki 50 contracts for the period 1986-1988. 

The Spot effect of weak Mondays and Tuesdays and strong Wednesday to 
Saturday is continuing. Wednesdays are the strongest days followed by Thursdays 
and Saturdays in the spot market. The Simex futures in Singapore look just like 
the spot market in Tokyo except in the Wednesday—Thursday period. Possibly this 
market is following rather than leading, although other evidence suggests the lead- 
ing hypothesis. The futures seem to rise and fall with the spot. The Osaka 50 seems 
to anticipate the days of the week effect. Saturdays hardly rise at all in the futures 
markets despite the spot gains, thus anticipating Monday’s fall. Then these futures 
fall on Monday to anticipate a further fall on Tuesday. They then rise on Tuesday 
to anticipate the gains on Wednesday, Thursday, Friday, and Saturday. They are 
then flat until the fall the next Monday. Figures 13.7 and 13.8 on page 277 visually 
show the results. 
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Figure 13.7: Days of the Week Effects in the ND Spot Market and the Osaka 50 Spot Market in 
Tokyo and the Kabusaki 50 Futures in Osaka, June 6, 1987-Sept. 20, 1988. 
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Figure 13.8: Days of the Week Effects in the ND Spot Market in Tokyo and the SIMEX Futures 
Markets in Singapore, Sept. 3, 1986 to Sept. 19, 1988. 


Notes 


1. For a sampling of the U.S. literature on the day of the week effect, the reader 
may consult Connolly (1988ab), Dimson (1988), Flannery and Protopapadakis 
(1988), French (1980), Gibbons and Hess (1981), Harris (1986), Hirsch (1986), 
Jacobs and Levy (1988), Keim and Stambaugh (1984), Lakonishok and Levi 
(1982), Lakonishok and Smidt (1989), Mclnish and Wood (1985), Miller (1988), 
Rogalski (1984), Smirlock and Starks (1980), and Ziemba (1990). 

2. The law governing this was enacted in 1978 and is titled “maigetsu no daini 
doyoubt igai no doyoubi ni kansuru gyoumukitei, shinyoutorihiki oyobi taishaku 
torihiki kitet narabini jutakukeiyaku junsoku no tokurei.” As Ikeda had pointed 
out to us in a private communication, the discussion in Jaffe and Westerfield 
(1985a), p. 267 on this point is incorrect. 
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Chapter 14 


Comment on “Why a Weekend Effect?” 


William T. Ziemba* 


The weekend effect in U.S. security markets has been documented by French (1980), 
Gibbons and Hess (1981), and others. Miller (1988) argues that the effect could be 
explained by a tendency for self-initiated sell orders to exceed self-initiated buy 
orders over the weekend, while broker-initiated buy trades result in a surplus of 
buying during the remainder of the week. 

This causes security prices to fall over the weekend and during the day on Mon- 
day as market makers sell back stock on the open. Prices then move higher dur- 
ing the week because of broker-induced buying. Osborne (1962) presents a similar 
hypothesis, which also argues that institutional investors are less active on Mondays 
as effort is made that day to plan the week’s trades. 

The day-of-the-week variation is higher for small-capitalized than for large- 
capitalized firms because of the larger bid-ask spreads and the thin trading in these 
generally low-priced securities. Keim and Smirlock (1987) document this for U.S. 
markets, and Stoll and Whaley (1983) confirm the bid-ask spreads. 

Miller’s idea is predicated on the fact that people are simply too busy to think 
much about stocks during the week. If they do anything, it’s more often than not 
to buy upon the recommendation of a broker. Brokers have a vested interest in 
purchases. First, they do not have to find people who own stock and suggest they sell 
it. Second, they reveive two commissions for the purchases (usually recommended 
by the broker) and the sale (usually initiated by the stock owner). 

Groth, Lewellen, Schlarbaum, and Lease (1979) survey 6,000 broker recommen- 
dations. Eighty-seven percent represent purchase and only 13% sales recommenda- 
tions. Dimson and Marsh (1986) report similar recommendations by U.K. financial 
analysts. As individual investors think about their holdings over the weekend, they 
tend more to sell than to buy. Individuals, on balance, are net sellers of stock. 


He thanks Asaji Komatsu of the Yamaichi Research Institute in Tokyo for research assistance 
and helpful discussions. Comments on an earlier draft by Edward Miller improved the comment. 
Thanks also to the Yamaichi Research Institute for financial and other assistance in conducting 
this study. 

*Alumni Professor of Management Science at the University of British Columbia in Vancouver, 
B.C. (Canada, V6T IY8). 
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EXHIBIT 1: Mean Buy/Sell Ratios and the Excess Return on Small Stocks by Trading Day in 
January and the Rest of the Year, 1971-1985. 
Source: Ritter (1988). 
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Exhibit 1 from Ritter (1988) illustrates buy-sell ratios with data on individual 
orders at Merrill Lynch for January and the rest of the year between 1971 and 1985. 
There is also a strong turn-of-the-year effect for small stocks on trading days —1 to +4. 

Although Miller’s story is plausible, he did not test the theory with real data. 
Lakonishok and Maberly (1990) have provided such a test with New York Stock 
Exchange odd-lot sales and purchases, sales and purchases of cash-account cus- 
tomers of Merrill Lynch, and NYSE block transactions. They find that Monday 
has the lowest trading volume. Insititutional trading is the lowest on Monday of all 
trading days, but individual trading on Monday is the highest relative to other days 
of the week. 

Individuals also sell more on Mondays. For example, odd-lot sales minus odd-lot 
purchases relative to NYSE volume were 29% higher during 1962-1986 on Monday 
than for the average of Tuesday to Friday. 

Theoretical support for imbalances on different days in trading volume, mean 
returns, and volatility based on the interaction of various traders has been advanced 
by Admati and Pfleiderer (1988, 1989). 


Day-of-the-Week Effects 


Previous, studies of holiday effects in Japanese spot and futures markets reveal 
strong and significant positive preholiday effects and negative post-holiday effects 
over the 1949-1988 period (Ziemba [1989, 1991]). Hence it is appropriate to separate 
out the day-of-the-week effects. 

Additional research on day-of-the-week effects in Japan appears in Jaffe and 
Westerfield (1985), Kato (1990), Kato, Schwartz, and Ziemba (1990), and Ziemba 
and Schwartz (1991). Recent research on Japanese financial markets is surveyed in 
Ziemba, Bailey, and Hamao (1991). 

This article investigates the weekend hypothesis for the Japanese market using 
daily data from May 16, 1949, to December 28, 1988. The data are broken into 
475 ten-year subperiods beginning with May 1949 to April 1958 and ending with 
January 1979 to December 1988. Exhibit 2 shows that individual investors were net 
sellers in Japan as well as the U.S. during 1981-1989. 

For each of the 475 ten-year periods the equation 


6 
Ri = 9044 x day; + X 5; 
j=l j=1236 
x before; + 5 a; X after; (1) 
j=1236 
is estimated. 
The coefficients a; refer to the single trading days that are after aj = 1, j = 2, 

j = 3, and j = 6-day break from a trading holiday or weekend. Similarly, the bjs 
refer to the single trading day before a 1, 2,3, or 6-day break from trading. Hence 
Equation (1) gives as coefficients y; for the six days of the week the pure effects of 
these days separate from the holiday and weekend effects. 
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EXHIBIT 2: Net Purchases (Sales) of Stocks in Billions of Yen February by Various Investor 
Groups, 1981-1989. 


Individuals Financial Industrial Investment 
Total Cash Margin Institutions Corporations Trusts Foreigners 
1981 —425 na na 417 287 —95 110 
1982 —330 na na 422 82 17 181 
1983 —862 —1417 555 769 27 —69 726 
1984 —55 —1228 1173 1479 319 324 —1922 
1985 —942 —1586 643 1825 283 468 —869 
1986 —1762 —3109 1327 4568 754 1057 —3787 
1987 —1307 —3350 2043 6274 1150 1811 —7196 
1988 —3288 —5102 1814 4365 626 1730 77 


1989 113 —4485 4599 4683 —1150 —2192 3348 


Source: Yamaichi Research Institute. 


Exhibit 3 gives the mean returns by day of the week estimated from Equation (1) 
after adjusting for pre- and post-holiday effects by year from 1949 to 1988. Each 
ten-year period is plotted at its final month. For example, May 1949 to April 1958 
is ploted as April 1958. 

Hence there are 475 such points for Monday, Tuesday, to Saturday in the estimate 
of qı to yẹ in Equation (1) for that ten-year period. The 475 points represent 
partially overlapping periods over the more than thirty-nine years of data. 

Exhibit 4 shows the test of hypotheses by day of the week that the daily 
return is not zero for each of the months in the years of the sample May 1949 
to December 1988. Most of the time the daily return is not zero (i.e., it is above the 
line) at the 5% significance level. 


Testing Miller’s Hypothesis 


A test of a hypothesis along the lines of Miller’s is shown in Exhibit 5 with 
P-statistics in Exhibit 6. Here P refers to the probability of accepting a false hypoth- 
esis using a two-tail test. Miller’s argument implies that individual investors reach 
net sell decisions on each weekend day (when they are not being urged to buy 
by brokers). These net decisions to sell made on the weekend come to market on 
Mondays, tending to force the price down. 

This theory implies that Monday declines after two days free of broker calls 
should be greater than after one day. The Japanese data provide a chance to test 
this prediction because some weekends provide only one day free of broker’s calls, 
and other weekends two such days. 

Case A, refers to the one-day weekends following weeks with Saturday trading, 
and Ag refers to the two-day weekends. The coefficients 4, and âz are from Equa- 
tion (1) for each of the 475 ten-year overlapping periods. The hypothesis is that the 
average returns on the As days are lower than those on the A; days. 

Exhibit 5 confirms this, showing Monday’s average daily returns for the 475 
ten-year periods. P-values for the hypothesis that the return on Monday is not zero 
using a two-tailed test appear in Exhibit 6. 


Comment on “Why a Weekend Effect?” 347 


Daily Rate 
of Return 
% 

124 


1.04 
0.85 


Bu 
uu 
ou 
mere 
œ u- 
o u 
oo 
ro 
No 
wo 
OS 
ua 
ao 
so 
on 
oo 
o s4 
ae 
NN 
wns 
aN 
uns 
one 
ss 
œ NH 
one 
o oH 
= œ- 
N œ 
w œ- 
a o 
u œ- 
o co 
~o 
œ œ- 
© o 


Daily Rate 
of Return 
% 


1.24 
1.0 4 
0.8 4 
0.6 5 
0.4 
0.25 
0.0 
0.24 
0.4 
0.6 4 
0.85 


-1.0- 
-1.2 - 


-1.4 


Pu 
uu 
au 
wu 
ou 
wou 
coos 
eo 
NOJ 
Wa 
BO 
ua 
ao 
No 
aon 
oo 
ons 
RN 
Nw 
wns 
BN 
uns 
ans 
~~ 
on 
ons 
oo- 
Fo 
No 
w œ- 
Bod 
u œo- 
ao 
No 
œ œ- 
wood 


EXHIBIT 3: Day-of-the-Week Effects by Ten-year Period Ending in the Plotted Month for Peri- 
ods Ending in April 1958 to December 1988 with Holiday and Weekend Effects Separated Out. 
Source: Yamaichi Research Institute. 


One-Versus Two-Day Weekends 


Sales efforts are intensified before holidays and weekends, which results in high 
returns on the preholiday Fridays and Saturdays. Exhibit 7 shows that the daily 
rise before two- day holidays and weekends is larger than far one-day breaks from 
trading using the by and bp coefficients. Exhibit 8 gives the P-values for the sample 
period 1954-1989. 
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EXHIBIT 4: P-statistics for Hypothesis that Daily Return is not Equal to Zero (Two-Tailed 
Test) for Exhibit 3. 
Source: Yamaichi Research Institute. 


The results provide further evidence that Monday returns (based on the time 
between the close of the previous week’s trading and the end of Monday’s) do not 
relate to a time in the investment cycle that would predict high returns, but rather 
ro the buy-sell relationships of the market participants. 

The results are averages over the ten-year periods, but the periods are over- 
lapping. To avoid overlapping, Equation (1) is applied to the decades of the 1950s 
(including 1949), the 1960s, the 1970s, and the 1980s for four non-overlapping peri- 
ods. Exhibit 9 describes the results. 
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EXHIBIT 5: Mean Daily Rates of Return on Mondays, Net of Holiday and Weekend Effects, on 

the NSA, for One-(A;) and Two-(A2) Day Weekends, 1960-1989. 

Source: Yamaichi Research Institute. 
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EXHIBIT 6: P-values for the Hypothesis of Non-zero Returns on Monday Using a Two-tailed Test. 
Source: Yamaichi Research Institute. 


The next-to-last column gives the coefficient ba;—baz, corresponging to the 
hypothesis that the two-day breaks yield lower returns. This was indeed the case, 
and significantly so except during the 1950s. 

Similarly, the last column shows the coefficient bg;—bp2, corresponding to the 
hypothesis that the market return is higher before a two-day break in trading than 
a one-day break. The results indicate this was the case in the first three decades, 
and the coefficient is significantly positive. During the 1980s, this effect was slightly 
negative, but the coefficient is not significantly different from zero. 
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EXHIBIT 7: Mean Daily Rates of Return on Preholiday and Preweekend Trading Days for One- 
(Bı) and Two-(B2) Day Trading Breaks on the NSA, for the 475 Ten-year Periods, 1949-1988. 
Source: Yamaichi Research Institute. 


EXHIBIT 8: P-values for the Hypothesis that the Returns on the Preholiday and Preweekend 
are Not Zero With a Two-tail Test. 
Source: Yamaichi Research Institute. 


Weekly Pattern of Returns Greatly Affected by Saturday 
Trading 


An interesting aspect of the day-of-the-week effect is that Tuesdays tend to have 
negative returns following a one-day weekend, and Mondays decline after two-day 
weekends. Exhibit 10 shows this effect. The calculations are for data from April 
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EXHIBIT 9: 


bwed 


0.0935 
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bp2 bai 


0.2329 —0.0555 
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0.1673* —0.1576* 
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0.1004 0.1163 
1.253 1.1790 


* ** or *** indicate that the coefficient is not equal to zero at the 10%, 5%, or 1% level, respectively. 


Estimated Coefficient Values for Equation (1) for the Four Decade Periods. 
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EXHIBIT 10: 


Trading Ends 


This 
Week Week 
Ist, 5th Sat 
Ai 
4th Ag Sat 
2nd Bı Fri 
3rd B2 Fri 
All Weeks 


Sample Size 


Last 
Week 


Sat 


Fri 
Sat 
Fri 


Mon 


0.0198 


—0.1008 
0.1135 
—0.3489 
0.0039 
449 


Tues 


—0.1072 


—0.1102 
—0.0649 
0.1931 
—0.0902 
464 


Wed 


0.1790 


0.1074 
0.0980 
0.2479 
0.1449 
464 


Thur 


0.0581 


0.0306 
0.0933 
0.2609 
0.0648 
465 


Fri 


0.0605 


0.1582 
0.1341 
0.2193 
0.1049 
467 


Sat 


0.1678 


0.0793 


0.1397 
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Effects of Saturday Trading on the Topix, April 4, 1978, to June 18, 1987, Mean Daily Returns in Percent. 
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8.67** 


2.53* 
4.05** 
0.46 


Sample 
Size 


1406 
649 
539 

58 


2652 


*and **indicate significance at the 1% and 5% levels, respectively, for rejection of the hypothesis that the mean returns are equal across days of the week. 
Source: Kato, Schwartz, and Ziemba (1990) and Kato (1990). 
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1978 to June 1987 when there was Saturday trading for the first and fourth weeks 
(and the fifth if there was one). 

The effects of Saturday trading from 1949 to 1988 are very interesting. First, 
Saturdays were extremely positive. Second, Saturdays were even more positive if 
the previous month had Saturday trading during the first week of the month in the 
sample period. Third, Mondays were negative if the previous Saturday was not a 
trading day (the third and fourth weeks), and even more negative if the third had 
no Saturday trading. However, Mondays were positive if the previous Saturday had 
trading, especially so if the second week of the month had no Saturday trading. 

As of the beginning of February 1989, Saturday trading in Japan ceased. The 
preliminary evidence since that time for the two-day weekends is broadly consis- 
tent with U.S. data and the results presented here, namely, negative Mondays and 
positive Tuesdays. For example, for the four months February to May in 1989, 
there were negative Mondays, —0.14%, positive Tuesdays, +0.17%, strongly posi- 
tive Wednesdays, +0.38%, mildly negative Thurdays, —0.03%, and positive Fridays, 
+0.14%. 

These results are based on data before the stock market decline of 1990-1992. 
Empirical anomalies such as the weekend effect tend to persist during periods when 
cash flow and institutional and individual investor behavior and constraints are 
functioning in a relatively smooth fashion. During extreme market declines and 
volatility such as the 63% drop in the NSA from January 1990 to August 1992, 
these anomalies are not as strongly present as investor fear and uncertainty prevent 
the usual behavior. Stone and Ziemba (1993) and Ziemba (1993) discuss this. 
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TEPPO MARTIKAINEN, JUKKA PERTTUNEN AND WILLIAM T. ZIEMBA 


Chapter 15 


The Turn-of-the-Month Effect in the 
World’s Stock Markets, January 1988 - 


January 1990 


1. Introduction 


Numerous researchers have discovered systematic 
scasonalities in stock market returns. These effects 
have been related to the day-of-the-week, holiday, 
turn-of-the-year and turn-of-the-month (TOM), 
among others. For instance, the extensive day-of- 
the week literature indicates significantly negative 
returns on Mondays or in the first half of the 
week.|1] Moreover, strong gains have occurred on 
pre-holiday trading days and negative returns on the 
post-holiday trading days. January returns have 
been higher than in other months especially for 
small capitalized stocks. Many of these seasonali- 
ties have been observed to be global. ZIEMBA 
(1994) reviews world wide security market regula- 
rities. 

This paper focuses on the turn-of-the-month effect. 
ARIEL (1987) reported that the returns of U.S. 
stocks were larger in the turn-of-the-month than in 
the rest-of-the-month for the period of 1963-1981. 
The turn-of-the-month was defined as a period from 
the last trading day of the previous month (-1) and 
the first four trading days of the new month (+1 to 
+4). His results have been supported by e.g., KEIM/ 
SMIRLOCK (1987), LINN/LOCK WOOD (1988), 
LAKONISHOK/SMIDT (1988), CINAR/VU 
(1991), and HENSEL/SICK/ZIEMBA (1993a, 
1993b). Similar patterns seem to appear both in the 
U.S. stock and stock index futures markets (which 
seem to partially anticipate the effects). The turn-of- 


the-month effect has also been documented in Japa- 
nese stock markets by ZIEMBA (1991) for the 
period 1949-1988. However, his results indicated 
that the Japanese turn-of-the-month effect was on 
days from -5 to 2 because of differing cash flows. 
The TOM effect was also observable in the Japanese 
stock index futures markets as well, and partially 
anticipated the cash effect (see ZIEMBA (1989)). 
BARONE (1990) also documents a turn-of-the- 
month effect in Italy that is in the second half-of- 
the-month because of Italy’s special settlement 
procedures. Recently, ZIEMBA (1994) confirmed 
the existence of a significant turn-of-the-month 
effect in the U.K. cash and futures markets. More- 
over, CADSBY/RATNER (1991) report the turn- 
of-the-month effect to be significant in Canada, 
Australia, Germany and Switzerland. 

There are various alternative explanations for the 
TOM regularity. The U.S. economy has substantial 
payments to private investors of salaries and debt 
interest on the -1 day of the month (see OGDEN 
(1990)). For instance, most of the corporate and 
municipal debt is payable on the first or last days of 
the month. Also bad news such as that relating to 
earnings announcements is delayed and announced 
late in the month, while good news is released 
promptly at the beginning of the month (PENMAN 
(1987)). HENSEL/SICK/ZIEMBA (1993a) offer a 
more detailed discussion on the underlying reasons 
for this regularity. This paper provides new eviden- 
ce on the TOM effect in the world’s stock markets. 
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The analysis of this regularity has been limited to the 
major financial markets of the world. {n this paper, 
the behavior of stock returns at the TOM is investi- 
gated in 24 countries and 12 different regions of the 
world. The global investigation of the TOM regula- 
rity is interesting because the major explanations 
offered for this regularity are based on institutional 
factors which may differ between countries. Thus, 
there may well be significant differences across 
countries. Moreover, the use of new data and new 
markets reduces the potential data snooping bias 
related to these type of studies (see e.g. LO/MAC 
KINLAY (1990)). The global results are also inte- 
resting in the context of recent deregulation and 
integration of the world’s financial markets. The 
loosening capital controls and faster information 
processing of world-wide news have made interna- 
tional portfolio management increasingly important 
for investors. The results also offer new insights in 
the context of the behavior of smal! markets.{2] A 
drawback of our study is that particular countries 
may have turn-of-the-month effects but at different 
times such as those discussed above in Japan and 
Italy. A complete study will analyze the common 
and different aspects of this seasonal anomaly in 
various world wide markets. 

This paper is organized as follows. The next section 
describes the data used in the study. The third 
section offers the empirical results. Section four 
concludes. 


2. The Data 


The data used in this study consists of daily closing 
stock prices of the Financial Times-Actuaries World 
Indices (see GOLDMAN SACHS (1988)). The 
indices are calculated separately for 24 countries in 
local currencies, U.S. dollars and U.K. pound ster- 
ling. These indices provide as full a coverage as 
possible; usually at least 70% of the aggregate 
market value of all domestic exchange-listed com- 
panies’ shares of stock. Regarding the coverage of 
each market by number of quoted companies, the 
aim has been for at least 10% of the available 


universe with up to 30 per cent being the preferred 
coverage. Markets, companies and securities are 
only included where the direct holding of capital by 
foreign nationals is permissible. Firms with an 
aggregate market value of less than $100 million are 
mainly excluded (at the end of 1985, the start date). 
The FT-indices are also calculated for different 
regions. The regional indices are measured in a form 
that reflects weighted local currency returns. In the 
world index, the Japanese and U.S. markets have the 
largest weights, followed by Great Britain. 

All indices are value-weighted and adjusted for all 
capital changes, including new issues and divi- 
dends. The data covers the period January 6th, 1988 
to January 31st, 1990. Thus, we use solely the post- 
crash period of 1987. This is because it has been 
shown that the behavior of stock markets has sig- 
nificantly changed after the crash, and that the co- 
movements between national markets are consider- 
ably stronger than before (see e.g., HAMAO/MA- 
SULIS/NG (1991), and MALLIARIS/URRUTIA 
(1992)). The returns are measured as first differen- 
ces of logarithmic price indices. Assuming con- 
tinuous compounding, these are good approximati- 
ons for returns. All returns are measured in local 
currencies and multiplied by 100. 


3. Empirical Results 


Figure ł shows the average return pattern by trading 
day for the Financial Times-Actuaries World Index. 
As expected, the largest returns are observable on 
day -1, i.e. the last day of the trading month (com- 
pare e.g. ZIEMBA (1991), pp. 143-144, for Japane- 
se results). Also other days around the TOM have 
positive returns. This supports the assumption of the 
existence of the TOM regularity in the world’s stock 
markets. The weight of U.S. markets is considerable 
in the World Index. Thus, it is possible that the 
above result is mainly due to the U.S. TOM effect, 
rather than due to a global phenomenon. To investi- 
gate this further, the analysis was replicated for 35 
other indices. Some of those results are displayed in 
Figures 2-4. 
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Figure 1; Average daily returns at the TOM for the World Index 
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Figure 2: Average daily returns at the TOM for the USA 
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Figure 3: Average daily returns at the TOM for Europe 
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Figure 4: Average daily returns at the TOM for Japan 
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Figure 2 illustrates the variation of stock returns by 
trading day in U.S. markets. As expected, the mean 
returns from -1 to 3 are all positive. The returns on 
-1 day are large. This supports the earlier findings by 
HENSEL/SICK/ZIEMBA (1993a) on the signifi- 
cance of this day inthe U.S. markets. Figure 3 shows 
that there appears to be tendency towards the TOM 
effect in Europe as well. All returns between days - 
l and 4 have positive means. The results in Figure 
4 reveal that the TOM seems to be different in Japan. 
As suggested by ZIEMBA (1991), the TOM effect 
starts earlier in Japan [3] than in USA. This seems to 
be explained, for instance, by the fact that in Japan 
most salaries are paid prior to the end of the calendar 
month.|[4] 

Table 1 shows the average daily returns of 24 
individual stock markets and 12 different regions 
for trading days from -9 to 9. The average return for 
the World Index on day -1 is 0.41%, while the 
average returns are below 0.15% on all other days. 
Forthe U.S. the average - | day returns are very high, 
0.67%. World wide returns with the USA excluded 


had their highest return on the same day. This gives 
further support to the assumption that the TOM 
regularity is international rather than purely a U.S. 
phenomenon. Belgium, Italy [5], Japan, and The 
Netherlands had their highest return on the last 
trading day of the month. Only three markets had 
negative returns on day -1, namely New Zealand, 
Norway and South Africa. South Africa had consis- 
tently positive returns on days | to 5, Norway on 
days 1 to 4, and New Zealand on days | to 3. The 
TOM effect thus appeared later in these markets. 
The t-values for the null hypothesis that the returns 
do not differ from zero on different trading days of 
the month are in Table 2. For the World Index, the 
returns on days -} and 2 are significantly different 
from zero at the 0.05 level, while on other days they 
do not differ from zero. For most regional indices 
(exceptions Scandinavia and Pacific Ex. Japan), the 
returns on -1 day are significantly positive. Regar- 
ding individual countries, a similar pattern applies 
most clearly in U.S. markets. 


Table 1: Average daily percentage returns of various world wide stock markets and indices for trading days -9 to 9 


-9 -8 -7 -6 5 -4 -3 -2 -1 +i +2 +3 +4 +5 +6 +7 +8 +9 

Australia 029 O19 0.14 003 -004 -048 0o04 -0.06 0.23 0.01 o1 0.25 0.08 -0.04 -001 0.16 0.28 0.00 
Austria 0.00 -0.01 0.05 0.29 0.19 0.02 O15 0.19 LAES O3 053 0.31 0.10 0.01 0.34 0.35 0.29 0.19 
Belgium -0.04 012 -008 -U01 007 0.03 -012 -007 0.30 oog 02t 0.32 0.30 ou 0.02 Ott on -0.13 
Canada 0.07 -011 -0.05 006 -01 0.02 0.04 0.07 0.33 On 0.10 0.00 0.09 0.09 0.02 “ONL 0.10 -0.12 
Denmark O7 015 O15 040 092 -0.08 O12 024 0.21 03 0.33 0.29 0.32 0.03 0.02 (1.23 0.04 0.01 
Finland 0.15 013 0.06 -0.12 -0.09 -0.00 -0.09 0.12 0.23 0.00 O14 0.14 0.21 0.37 -0,02 -0.05 0.04 0.12 
France 022 006 024 058 O7 -0.14 0.1? 0.06 0.25 0.26 0.44 0,29 ooa 0.14 013 0.14 -0.18 0.0) 
Germany 0.43 0.06 001 0.05 0.06 -0.16 0.26 0.04 OLE 0.59 0.54 0.19 0.28 0.19 019 -0.04 0.16 0.03 
Hong Kong -0.35 -0.56 0.14 0.03 -0.06 -0.37 0.16 -0.03 0.32 -0.04 0.39 4.49 0.22 0.50 0.24 0.69 0.09 -0.06 
Ireland 04) 0.04 010 0.01 0.3) -0.09 0.12 0.19 0.05 -012 O18 kit 0.26 0.04 0.22 0.09 -0.24 0.02 
Italy -004 -008 -0.07 U04 -014 -001 -0.12 -0.04 0.32 0.14 -0.03 OUT 0.01 0.03 0.18 O18 0.03 0.04 
Japan -0.15 0.10 O11 -QIR 032 0.34 0.24 0.13 0.35 -0.10 0.12 0.01 0.06 0.03 0.03 0.27 ONT 0.03 
Malaysia -0.16 020 022 016 0.16 -003 0.0419 0.15 009 0.17 0.22 0.08 0.06 0.40 0.39 0.15 0.12 
Mexico O7 058 -008 045 050 O15 015 0.02 0.01 0.37 0.15 0.09 0.43 0.49 0.52 0.03 0.07 -0.38 
Netherlands 0.26 000 -DÛR -NOR 002 -0.08 OIR -0.06 0.22 017 O17 01.22 0.19 O18 0.16 0.06 0.38 0.14 
New Zealand } 0.02 -013 -01L 018 -0,24 0.71 0.27 0.08 O.1t 006 0.35 042 0.08 0.23 0.13 0.34 0.07 0,33 
Norway 0.48 043 -000 6.08 -00 -0.18 014 -0.09 -000 034 0.48 O18 0.22 0.03 019 0.46 KATI -0.20 
Singapore -036 -011 014 -013 003 -0.13 00l 0.27 0.1 aos 0.17 0.28 0.16 0.06 0.10 ost 0.21 0.08 
South Africa } 014 -009 026 025 019 -0.16 0.16 003 -0.06 028 033 0.52 0.67 0.83 -0.03 0.12 0,25 0.26 
Spain -015 -0.05 -0.05 -007 Olé -0.03 004 -0.14 0.24 0.26 O15 0.03 0.04 0.16 0.02 CAR 6.03 0.10 
Sweden -0.20 0417 -0.03 024 024 -0.05 027 02 0.19 012 024 0.37 0.46 0.04 0.27 0.19 0.18 0.00 
Switzerland -0.21 6.05 042 00l 004 -024 on O13 9.20 0.25 040 0.38 0.22 0.18 0.04 0.20 0.34 0.20 
UK 0.29 -013 000 005 -020 -00I 0.34 -0.09 0.21 012 O15 0.20 0.12 0.07 oDi «0.05 0.15 0.06 
USA -0.08 0.14 007 036 -010 0.00 -004 01$ 0.67 O18 0.06 0.15 0.08 -0.02 015S -0.17 -0.07 0.22 
Europe 017 -004 002 0.04 -005 -007 0.22 -0.04 0.20 021 0.25 O18 0.15 0.10 0.05 0.03 0.14 ooi 
Scandinavia f -0.09 019 -015 016 029 -0.04 0.32 012 0.08 038 0.38 0.37 0.31 0.48 0.36 0.19 0.19 0.13 
Pacific Basin | -0.14 009 012 -017 030 031 0.23 0.12 0.34 010 013 0.00 0.05 0.03 0.03 0.28 -0.16 0.03 
Evro-Pacific | -0.04 0.05 0.08 -011 019 0.19 0.23 0.07 0.30 0.06 0.23 0.06 0.01 0.06 0.04 0.20 046 0.02 
North America | -0.07 -013 0.06 034 -007 -003 -003 014 0.65 0.17 0.06 O14 0.07 -0.02 014-016 -008 -0.21 
Europe Ex, UK | 0.08 6.03 0.03 013 0.07 0.11 0.13 -0.02 0.21 0.30 0.32 0.18 0.17 0.13 0.08 0.09 D13 -0.04 
Pac. Ex.Jap. | 0.03 -0.09 013 G03 -007 -0.26 o -0M2 0.21 owm 0.21 9.04 0.05 0.09 0.09 0.36 KETI 0.02 
World Ex.US | -0.04 0.04 0.08 -010 0.18 0.18 022 0.07 0.29 o0 017 0.06 0.02 0.06 0.04 0.19 0.18 0.02 
World Ex. UK | -0.09 -0.00 0.09 0.06 012 0.14 012 OIL 0.44 005 Oig 0.07 0.03 0.03 0.08 0.09 0.13 -0.07 
World Ex. SAF. | -0.05 -0.02 0.08 0.05 0.09 0.13 014 009 0.42 006 O54 0.08 0.02 0.03 0.08 0.08 0.13 0.06 
World Ex. Jap. | 0.02 -0.09 0.06 022 DOR -003 0o 0m 046 QIR Qis O15 0.02 0.04 GANI -0.07 0.10 0.12 
The World ind. | -0.05 -0.02 0.08 0.05 009 0.13 o4 009 o4 006 0.14 0.08 0.01 0.03 0.08 0.08 0.13 0.06 
Period: January 1988 - January 1990. 
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Table 2: T-values for the hypothesis H: R = 0 for Table 1 


T -9 -8 -7 6 -5 4 -3 -2 -i + +2 +3 + +5 +6 +7 +8 +9 
Australia 1.50 0.85 0.24 -0.18 171 -0.32 -0.28 -0.06 0.81 -1.97 0.03 
Austria 0.00 0.31 1.48 149 21s” 0.42 0.04 2.04 2.26" 1.70 0.76 
Belgium -0.21 -0.67 -0.06 0.86 2.63" 2.20° 0.74 0.10 1.00 1.06 -0.90 
Canada 0.96 -0.49 0.46 -L41 -0.04 0.78 0.59 0.13 -0.96 -0.73 -1.28 
Denmark 1.46 1.29 1.05 1.36 207 2.58° 0.27 0.15 1.37 0.23 0.12 
Finland 1.05 0.33 -0.87 -0.83 -0.69 140 2338" -0.15 -0.29 0.23 0.84 
France 1.21 1.57 3.79 0.77 1.86 0.27 0.98 0.85 0.70 -1.08 0.04 
Germany 0.59 0.04 6.29 031 1.06 2.02 Wut 0.96 -0.23 -0.81 a3 
Hong Kong -1.44 0.35 0.14 -0.13 -0.44 ORL 1.75 0.92 30r 0.25 -0.29 
Ireland 2.16" 0.67 0.04 1.42 0.60 PAS 0.20 0,76 0.64 L51 08 
Italy 0.17 -0.45 -0.27 -0.72 -0.93 0.06 -0.20 -1.04 1.16 0.35 0.23 
Japan -1.10 0.99 -1.56 1.67 -0.05 -051 0.30 O18 -I8 0.21 
Malaysia -0.80 1.39 0.78 0.90 0.99 0.39 0.18 1.42 0.78 0.78 
Mexico. 219° -0.20 0.93 1.07 O18 1.70 1.77 L7 0.19 -1.26 
Netherlands 1.44 -0.58 -0.52 O11 1.45 1.39 1.27 0.6% 2.38" -0.68 
New Zealand 0.10 -0.54 0.72 -1.02 1.34 -0.31 -1.29 0.53 -0.46 -220W 
Norway 1.69 -0.02 0.28 -0.07 1.19 1.09 -0.17 -0.65 191 -0.29 -0.90 
Singapore -0.90 0.81 -0.73 017 1.17 093 0.41 0.32 284 0.73 0.58 
South Africa 0.58 0.90 114 0.97 209 3.78" 1.70 -0.12 0.64 -4.20 -1.20 
Spain -1.87 -0.47 -0.60 4.74 0.22 0.32 LA O11 110 0.28 -0.75 
Sweden -1.15 -0.16 1.48 tl 2.59° 2.46" 0.27 136 1.49 0.89 0.00 
Switzerland 0,7) 1.03 0.04 0.18 270 1.314 1.05 -0.16 1.42 2.27" -1.28 
UK 2.07" -t.44 1.66 0.79 0.50 0.07 -0.33 -1.05 0.40 
USA -0.50 0.64 LOI -0.50 -0.07 1.02 -1.60 -0.40 -1.46 
Europe 1.47 -0.43 241 164 1.12, 0.38 0.31 A2 0.07 
Scandinavia -0.56 1.79 219° 2.09" +102 239° 1.43 1.75 0.99 
Pacific Basin -1.07 1.65 -0.03 -0.46 0.34 0.21 2.67" -1.22 0.20 
Euro-Pacific -0.56 1.44 067 O11 0.69 0.30 2.18" -1.35 0.22 
North America | -0.46 -0.45 0.97 -0.46 -0.05 1.01 -1.59 -0.42 -1.46 
Europe Ex. UK | 0.63 0.49 216 2.02 1.41 0.53 0.86 At -0.28 
Pac. Ex. Jap. 0.21 -0.32 0.13 0.33 0.72 0.50 2.18" -06t -0.16 
World Ex.US | -0.35 1.40 0.72 0.21 0.76 0.30 214" A395 0.16 
World Ex.UK | -0.93 ki . 0.89 08A -0.26 0.19 0.89 1.32 -4.03 -0.75 
World Ex, SAf. | -0.57 1.03 0.41 0.71 0.97 -0.19 0.21 0.82 1.05 -1.07 -0.62 
World Ex. Jap. | 0.19 0.63 1.54 -0.67 148 O18 O19 1.09 -0.79 0.73 -1.13 
The World Ind. | -0.57 1.06 0.42 0.72 1.00 0.14 0.23 0.83 1.07 -LOR -0.64 
* significant at 0.5 level. 
Period: January 1988 - January 1990. 
Table 3: T-values for the hypothesis H,: R = R for Table 1 

9 a 7 -6 -5 -4 -3 -2 -l +l +2 +3 +4 +5 +6 47 +8 +9 

Australia 0.50 0.24 -0.46 -1.09 -0.07 -0.76 0.93 -0.21 0.25 132 -0.28 
Austria ©.71 0.65 0.23 -0.86 OAT O12 0.12 ORR 1.23 4.03 on 
Belgium 1.25 OSE 0.03 -0.70 -2.88 -4.13 0.81 0.04 1.21 2.02 -1.39 
Canada -0.69 0.30 -4.66 0.02 0.23 0.76 32x 1.00 0.85 -0.22 “1.50 
Denmark -0.02 0.65 0.37 -1.50 4.36 0.46 0.30 -0.14 1.02 0.99 -1.08 
Finland 0.04 -4.27 -4.35 -0.40 -0.62 0.43 t09 -0.29 0.74 -0.95 0.45 
France 0.87 3.08" 0.29 -1,26 0.43 -0.94 0.83 FBT 2.18" las -0.41 
Germany 0.68 0.29 -0.24 -1.36 O71 -0.63 0.01 248° 1.84 0.47 -0.39 
Hong Kong 0.26 -0.04 -0.21 -2.50° 0.63 -0.30 1.32 OAL BRI -0.47 -0.47 
Ireland OMR -0.44 0.90 -1.42 0.02 0.33 -0.50 1.01 O31 -0.04 -0.65 
haly -0.79 0.64 -1.03 -0.37 1.01 0.79 1.88 047 -0.41 -1.25 -0.11 
Japan 0.36 28` 1.29 1.57 1.05 0.40 243° 145 0.48 -0.72 -0.31 
Malaysia 0.54 O10 0.13 -0.89 -1.09 0.36 0.13 -0.33 O18 0.38 -0.08 
Mexico, -0.84 0.38 0.51 -1.61 -0.43 0.92 -0.53 0.24 -1.07 -0.34 -218 
Netherlands -1.03 -0.94 -0.21 -0.84 0.73 -0.70 1.04 0.50 0.70 1.04 -0.99 
New Zealand -0.53 0.73 - £00 2,23° 1.12 0.39 0.62 0.35 1.31 125 -2148 
Norway -0.60 -0.19 -0.70 -1.45 0.03 -0.85 -0.52 0.85 1.51 0.28 -1.50 
Singapore 0.30 1.23 -0.37 -1.07 -0.43 1.07 0.15 -0.21 0.35 0.80 -0.07 
South Africa 0.53 0.64 041 -1.14 02t 0.41 -0.65 O88 Lid 1.65 -t69 
Spain 0.71 0.80 1.47 0.49 0.24 L71 1.69 1.63 0.94 -0.40 -0.94 
Sweden -0.86 0.67 0.49 -1.06 0.89 0.53 0.38 -0.04 O71 1.66 -0.81 
Switzerland 0,63 -0.22 -0.41 118 0.44 049 046 1.27 1.95 2.38° -1.57 
UK -0.43 -0.55 -1.78 -0.33 1.90 -0.74 1.05 045 08l 1.26 0,08 
USA 0.22 1.66 -0.91 -0.30 0.49 0.65 3.39" 0.95 Oii 0.72 +75 
Europe -0.55 -0.19 -0.96 -4.08 1.45 -0.87 1.38 1.21 1.93 1.51 -0.48 
Scandinavia -1.94 0.13 1.07 -0.95 1.62 0.00 -0.32 0.66 1.86 1.51 0.12 
Pacific Basin 0.41 -2.16" 1.25 1.44 1.03 0,36 2.45° -1.14 0.59 0.69 -0.33 
Euro-Pacific 0.17 -1.68 0.90 0.94 1.26 0,01 2.54° -0.97 1.59 -0.16 -0.45 
Nonh America 018 1.6) -0.72 -0.52 -0.46 0.66 3.42" 0.96 0.15, 0.69 -1.75 
Europe Ex. UK | -0.05 -0.55 0.38 -0.14 -1.32 0.40 0.84 1.43 1.86 2.23" 1.13 0.91 
Pac. Ex. Jap. -0.10 0.46 -0.17 -0.53 -t.82 0.29 -0.49 1149 -031 0.95 0.01 -0.50 
World Ex. US | -0.99 0.16 1.64 0.86 0.92 1.27 0.01 2.57 0.50 1.35 -0.13 -0.53 
World Ex. UK | -1.59 0.34 -0.03 0.42 075 0.59 0.46 3.37° -0.08 1.22 0.13 -1.42 
World Ex. SAF. | -t.21 0.26 -0.12 0.23 0.67 0.80 0.32 3.347 -0.02 1.18 0.26 +128 
World Ex. Jap. | -0.29 0.05 1.17 -1.13 0.84 0.13 0.15 ar 1.10 0.90 0.98 -1.60 
The World Ind, | -1.22 0.27 0.11 0.23 0.65 0.82 0.30 3.32" -0.01 1.21 0.28 -1.30 


* significant at 0.5 level. 
Period: January 19R8 - January 1990. 
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Table 3 provides the t-values for testing the null 
hypothesis that the average returns by trading day of 
the month do not differ from the average returns 
across all trading days. The results indicate that 
there is significant variation in returns by trading 
day of the month. For most regional indices, inclu- 
ding, for instance, the World Index, Pacific Basin 
and North America the returns on the last trading 
day of the month are significantly higher than on 
other days. 

Table 4 provides statistical tests for the TOM phe- 
nomenon. Average daily returns are computed for 
four periods, i.e. the turn-of-the-month (trading 
days from -1 to 4), the first-half-of-the-month (from 


-1 to 9), the rest-of-the-month (from 10 to -2), and 
the whole period. The average values of these 
returns appear in the first four columns of the table. 
They indicate consistently positive returns for all 
markets and periods. Mexico, Austria, and Den- 
mark had the highest average returns during the 
period. 

The remaining columns of Table 4 show the t- 
statistics for the null hypotheses that the average 
returns by periods do not differ from zero or the 
average returns through the research period. The 
tests of the latter hypothesis indicate that from the 36 
indices, the TOM effect between trading days - ł and 
4 was statistically significant in 12 cases at the 0.05 


Table 4: Average daily percentage returns and t-values during the different periods of month for various world wide 


stock markets and indices 


Average daily returns t-values for Hy; R =0 values for Hy: R = R 
-lto+4 -]to+9 410t0-2 Alldays Flio+4 -lto+9 +l0to-2 Alldays |- to+4 -lto+9 +l010-2 Alldays 
+ 
Australia 0.105 0.035 0,074 0.056 1.349 0.625 1.197 1.332 0.632 -0.387 0.296 -0.000 
Austria 0.306 0.271 0.068 06l 2925" IY 1.228 3.592’ 1.382 1.505 + 1.698 -0.000 
Belgium 0.240 0.140 0.014 0.072 1295" RSF 0.363 2.326° 2.300 1.379 -1.476 0.000 
Canada 0.126 0.041 0.003 0.020 2.747 1.091 0.081 0.814 2.308" 0.548 -0.525 0.000 
Denmark 0.256 0.160 4.142 0.150 3.770" 3.4017 2.575" 4.089" 1.553 0.205 -0.150 -0.000 
Finland 0.091 0.091 0.022 0.054 1,223 1.828 0.429 1512 0.503 0.752 0.640 0,000 
France 0.257 0.151 0.073 0.109 3,635" 2.770" 1.290 2.757" | 2.093" 0.767 -0.632 -0,000 
Germany 0.333 0.186 0.038 0.106 4.276 3.202 0.478 2.112" 2.915° 1.374 -0.868 -0.000 
Hong Kong 0.083 0.179 -0.090 0.035 0.352 1.368 -1.003 0.446 0.206 1.105 -1.389 0.000 
Ireland 0.097 0.064 0.154 0.112 1.226 L118 2.183" 2.431 | -0.195  -0.844 0,588 0.000. 
Italy 0.057 0.032 0.079 0.057 0.724 0.607 1.281 1.394 0.002 -0.484 0,353 -0.000 
Japan 0.062 0.050 0.092 0.073 1.131 1.256 2.154 2.468" | -O.198 -0.567 0.454 0.000 
Malaysia 014I 0.180 0.100 0.137 1.786 2.974" 1.401 2.8867 0.057 0.716 -0.523 -0.000 
Mexico 0.070 0.192 0.329 0.266 0.370 1.578 2.834" 3.1647 | -1.043  -0.607 0.546 -0.000 
Netherlands 0.195 0.087 0.043 0.063 2.745" 1.602 0.775 1.622 1.854 0.433 -0.362 -0.000 
New Zealand 0.123 0.045 -0,044 -0.003 LLIS 0.619 -0.541 -0.058 1.144 1.663 -0.502 0.000 
Norway 0.241 0.113 0.154 0.135 2.456" 1.434 L8I8 2.317" 1.084 -().282 0,225 -0.000 
Singapore 0.154 0.174 0.017 0.090 1.788 2.602 0.253 1.865 0.747 1.262 -1.061 -0.000 
South Africa 0.348 0.183 0.044 0.108 3.497" 248X 0.566 2.007" 2.413" 1.016 -0.830 0.000 
Spain 0.133 0.088 -0.030 0.024 2.2217 2.146 -0.746 0.828 1.819 1:557 -1.335 0.000 
Sweden 0.280 0.208 0.068 0.133 3461 3.732" 1.154 3.236" 1.819 1.349 -1.089 0.000 
Switzerland 0.268 0.109 -0.010 0.045 3.873" 2.002" -0.150 0.992 3.228" 1.180 -0.794 -0.000 
UK 0.160 0.074 0.026 0.048 2.536 1.544 0.563 1.442 1.774 0.535 -0.469 0.000 
USA 0.197 0.064 0.026 0.043 2.689" LIS 0.466 1.101 2.096" 0.367 -0.324 -0.000 
Europe 0.198 0.103 0.036 0.067 4.6637 3.044" 0.894 2.497" 3.077" 1.062 -0.761 -0.000 
Scandinavia 0.301 0.190 0.050 0.115 3.254 2.616 0.664 2.177" 2.008" 1.027 -0.853 -0.000 
Pacific Basin 0.065 0.053 0.087 0.071 1.221 1.361 0.975 L388 -0.125 0.485 0.179 -0.000 
Euro-Pacific 0.107 0.069 0.071 0.070 2.319  2.086° 202r 2.887" 0.801 -0.020 0.017 0.000 
North Amcrica 0.193 0.063 0.024 0.042 2.720" 1.149 0.454 1.100 2.128" 0.380 -0.334 -0.000 
Europe Ex. UK 0.234 0.130 0.045 0.084 5.514 3.669 0.952 2.788" | 3.537 1.294 -0.836 -0.000 
Pac. Ex. Jap. 0.103 0.093 0.011 0.049 1.121 1.598 0.189 1.192 0.587 0.760 -0.661 0.000 
World Ex. US 0.110 0.070 0.068 0.069 2.624 2.213 2.025" 2.967 0.975 0.023 -0.018 -0.000 
World Ex. UK 0.136 0.068 0.057 0.062 3.046 2.080" 1.806 2.737 1.656 0.177 -0.156 0.000 
World Ex. SAf. 0.136 0.067 0.054 0.060 3.112° 2.098" 1.756 2.709" 1.735 0.216 -0.192 -0.000 
World Ex. Jap. 0.192 0.080 0.028 0.052 3.733 2.056" 0.747 1.921 2.717 0.717 -0.634 -0.000 
The World Ind. 0.138 0.068 0.054 0.061 3.164 2.137 1.761 2.735* |. 1.770 0.232 -0.206 0.000 
* significant at 0.5 level. 
Period: January 1988 - January (990, 
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level. Regarding regional indices, the TOM effect 
was significant in Europe, Scandinavia, North 
America, Europe Ex. U.K and World Ex. Japan. 
Thus, the World Index does not reflect any positive 
TOM in this case. This is apparently due to the fact 
that the TOM regularity was observed to occur 
earlier in Japan than in other countries. When Japan 
is excluded from the World Index, the TOM regu- 
larity is significant. The first-half-of-the-month was 
always statistically insignificant. 

Regarding individual countries, significant TOM 
returns are observed in Belgium, Canada, France, 
Germany, South Africa, Switzerland and U.S.A., 
when the hypothesis that the returns do not differ 
from the average returns across all trading days is 
tested. There are alternative explanations why the 
TOM is not statistically significant in other coun- 
tries. One reason may be that for some markets the 
FT-indices are rather thin and do not cover the 
market behavior of these countries well enough. 


4. Conclusions and Further Research 


This paper provides new evidence on the turn-of- 
the-month effect in the world’s stock markets. In 
earlier research, the analysis of this regularity has 
been limited to the major financial markets of the 
world. We investigate the phenomenon in 24 indi- 
vidual countries and 12 different regions of the 
world. This global investigation is interesting be- 
cause the major explanations offered for this regu- 
larity are based on institutional factors. Moreover, 
the use of new data sets reduces the potential data 
snooping bias related to these type of studies. The 
turn-of-the-month effect was found for several coun- 
tries, as well as for most regions of the world. The 
effect was strongest in the U.S. markets. Regarding 
individual days, the returns of -1 day were higher 
than the returns of other TOM days. 

Due to the short estimation period used, the results 
of this paper should be replicated with a much larger 
data set. This data would also enable investigation 
of the association between the TOM regularity and 
other seasonalities, such as the January effect, in 


various countries as well as differing institutional 
and investor practices and constraints. In addition, 
it would be interesting to study the relationship 
between firm size, share price and the TOM regula- 
rity in different countries. Finally, the lead-lag 
relationships between markets with respect to the 
TOM effect might offer an interesting avenue for 
further research. 


Footnotes 


{1] In most markets, the Monday effect is observable. In 
some smaller markets, however, the Tuesday effect 
seems to be dominant (see MARTIKAINEN/PUTTO- 
NEN (1993) for a review). The day-of-the-week effect 
is sensitive to the research period used and possibly has 
been absent in U.S. markets in the late 1980’s and early 
1990's. 

[2] THEOBALD/PRICE (1984) discuss the problems of 
seasonality estimation in thin markets. 

[3] In 1990-1992 there was no turn-of-the-month effect in 
Japan (see ZIEMBA (1994)). The reason for this may be 
that with an extreme declining market individual inves- 
tors were not interested in stock market investment at 
the times they had cash flows. 

[4] For instance, most salaries are paid on days 20 to 25 of 
the month with the 25th being especially popular. 
ZIEMBA/SCHWARTZ (1992) discuss numerous other 
reasons as well. The include that there is portfolio 
window dressing on day -1, there is buying on day -3 
because of the investment strategies of security firms, 
and large brokerage firms have a sales push that lasts 7 
to 10 days starting around day -5. 

[5] Table {’s results are also consistent with BARONE’s 
(1990) 1975-1989 Italian data set. The -1 day has high 
mean return; but the TOM effect occurs in the second 
half of the month because that is how the settlement 
procedures in Italy define it. Their month essentially 
begins on the (5th and ends on the 14th with all orders 
up to the 15th settled on the -1 day. Hence there tends 
to be buying immediately after the 15th and selling 
immediately after the -1 day. 
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The mean return for small and large capitalized stocks in the cash and futures 
markets was positive in the first half of the month and negative in the second half 
of the month during the 10-year period of futures trading from May 1982—April 
1992. The mean return in the cash and futures markets for small and large 
capitalized stocks at the turn-of-the-month five-day trading period was significantly 
greater than average. There was partial anticipation of the cash turn-of-the-month 
effect in the futures markets on the previous three trading days. There was 
seasonality in the monthly return patterns, with the first and last quarter exhibiting 
higher returns at the turn-of-the-month and in the first half of the month. These 
results are an out-of-sample confirmation of the turn-of-the-month anomaly 
Ariel(1987) reported for the cash market in the earlier period 1963-1981. The 
anomaly appears in the cash and futures markets, ruling out many explanations of 
the cash market anomaly that are based on trading frictions. 
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The Turn-of-the-Month Effect in the US Stock Index Futures Markets 


1. Introduction 


There is considerable belief in the investment profession that there is a 
turn-of-the-month effect in US and other security markets. Market advisors such 
as Merrill (1966), Fosback (1976), and Hirsch (1986) have argued that stocks 
advance at the turn-of-the-month. The five days consisting of the last trading day 
of the previous month (-1) and the first four trading days of the new month (+1 to 
+4) are referred to as the turn of the month. Roll (1983) has documented that 
there were significant positive returns during the years 1963 to 1978 in January at 
the turn of the month and, in particular, that the small-capitalized stocks 
significantly outperformed the large capitalized stocks on these days. 

Ariel (1987) documented the turn-of-the-month and first-half-of-the-month 
effects (trading days -1 to +9) for small and large capitalized US-traded stocks in 
the 19 years from 1963-1981, using equal- and value-weighted indexes of all 
NYSE stocks. His research showed there were very high returns at the turn of the 
month. The rest of the market gains during 1963-81 occurred in the second week 
of the month. The first half of the month had all the gains, and the second half of 
the month had negative returns. 

Lakonishok and Smidt (1988) reported that over the 90-year period, 1897— 
1986, the large-capitalized Dow Jones Industrial Index rose 0.475% during the 
four day period -1 to +3 each month, whereas the average gain for a four-day 
period is 0.061%. The average gain per month over these 90 years was 0.349%. 
Hence, aside from these four days at the turn of the month, the DJIA had negative 
returns. 

The reasons for the turn-of-the-month effect are several, but they are 
largely cash flow and institutionally based. For example, the US economy often 
has cash payments to private investors of salaries and debt interest on the -1 day 
of the month. In addition, there are institutional corporate and pension fund 
purchases at the turn of the month. These cash flows vary by month and lead to 
higher average returns in January, which has the highest cash flow. Odgen 
(1990) presents some empirical support of this hypothesis and related monetary 
actions for US markets. For example, 70% (90%) of the interest and principal 
payments on corporate (municipal) debt are payable on the first or last days of the 
month. Moreover, 45% (65%) of dividends on common (preferred) stock are 
payable on these dates. Another factor in this effect seems to be behavioral. 
One manifestation is that bad news such as that relating to earnings 
announcements is delayed and announced late in the month, while good news is 
released promptly at the beginning of the month [see Penman (1987]. 

Evidence from Japan by Ziemba (1989,1991) and Canada, the United 
Kingdom, Australia, Switzerland, and West Germany by Cadsby and Ratner 
(1991) provides further support for the cash flow hypothesis. In Japan most 
salaries are paid from the 20th to the 25th of the month, with the majority on the 
25th. Security firms can invest for their own accounts based on their 
capitalization, which results in buying commencing on the -3 day with their three- 
day settlement. Individual investors buy mutual funds with their pay, which they 
receive on calendar days 15 to 25 of the month; the funds are then invested in 
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stocks with a lag. Employee stock-holding plans and mutual funds also receive 
money in this period to invest during the turn of the month. 

Using data from 1949 to 1988, Ziemba (1991) found that the turn of the 
month in Japan is similar to that in the US. However, the dates change, with the 
turn of the month being trading days -5 to +2 with +3 to +7 the rest of the first half 
of the month. Ziemba also found that each of the days -5 to +2 had significantly 
positive returns. Moreover, all the gains in the stock market were in the first half 
of the month. The securities firms have some information regarding these trading 
patterns and plan their monthly sales efforts during the -5 to +7 period. 

Ziemba (1989) investigated the futures market trading outside Japan on the 
Simex in Singapore at the turn of the month in Japanese security markets during 
the period September 1986 to September 1988, before there was futures trading 
on the NSA or Topix in Japan. The spot effect was consistent with past data, so 
the futures market did not alter the effect. However, the futures market in 
Singapore anticipated the effect on days -8 to -5 with a total average rise of 2.8%. 
Then, when the effect occurred on days -5 to +2 and the spot market gained 
1.7%, the futures market had zero returns. 

Cadsby and Ratner (1991) investigated turn-of-the-month and holiday 
effects in Canada, the United Kingdom, Australia, Switzerland, West Germany, 
Japan, Hong Kong, Italy, and France. Their use of -1 to +4 as the turn of the 
month in all these countries makes their results hard to interpret, as there may be 
cash flow and institutional constraint date alterations as in Japan. Still, they find a 
significant turn of the month in most countries. Ziemba (1993a) also confirms the 
existence of a significant turn-of-the-month effect in the United Kingdom cash and 
futures markets. 

In this paper we investigate the turn-of-the-month and _first-half-of-the- 
month effects in the 10-year period of trading in US index futures from May 1982 
to April 1992. Results are presented for the cash and futures markets for small- 
capitalized stocks proxied by the Value Line Composite Index and for large- 
capitalized stocks proxied by the S&P 500 Index. The results are broadly 
consistent with those of Ariel (1987) in that all the gains in the stock market for 
small and large-capitalized US securities occur in the first half of the month. For 
small and large-capitalized stocks there were significant positive returns in the 
cash and futures markets at the turn and first half of the month. These returns 
were significantly greater than average during the turn of the month. 

Mean returns were lower than average in the second half of the month and 
significantly so for the small-capitalized stocks in the cash market. There is a 
strong monthly seasonality effect, particularly for the small-capitalized stocks. The 
returns were significantly higher than average at the turn and first half of the 
month in January, February, March, and November for large-capitalized stacks. 
Mean returns are greater than zero during the turn of the month in each of the first 
and last three months of the year and above, but not significantly greater than, 
average. The month-by-month returns for the large-capitalized stocks have 
similar patterns to the small-capitalized stocks, but the mean returns are not as 
large in the first half of the month or as negative in the second half of the month. 
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Ariel found abnormally large returns starting only on day -1 in his early cash 
market sample. We find abnormally large returns starting on day -4 in both the 
cash and futures markets in our later, non-overlapping sample. Perhaps this is a 
result of speculative activity on the futures market anticipating a cash market gain 
starting on day -1, which moves the anomaly ahead to day -4. Index arbitrage 
may then be forcing the anomaly to appear on day -4 in the S&P cash market as 
well, although it is not statistically significant. This is consistent with the greater 
index arbitrage activity for the S&P 500 contract. 

The results in this paper add to the literature on seasonal anomalies in 
security markets, of which the following are some of the important references. 
Discussion of the day-of-the-week effect appears in French (1980), Gibbons and 
Hess (1981), Jaffe and Westerfield (1985ab), Kato, Schwartz and Ziemba (1989), 
Lakonishok and Maberly (1990), Solnik (1990), and Ziemba (1993b). Holiday 
effects are analyzed by Ariel (1990), Cadsby and Ratner (1991), Pettengill (1989), 
and Ziemba (1991). Monthly effects are studied by Gultekin and Gultekin (1983), 
Brown, Kleidon, and Marsh (1983), Jacobs and Levy (1988), Hawawini (1991), 
Jaffe and Westerfield (1989), and Cadsby (1992). Harris (1986) studied time-of- 
day effects. January and size effects are analyzed by Rozeff and Kinney (1976), 
Banz (1981), Reinganum (1981), Keim (1983, 1989), Roll (1983), Ritter (1988), 
Ritter and Chopra (1989), and Jaffe, Keim, and Westerfield (1989). Surveys of 
this literature appear in Thaler (1987ab), Ziemba (1993a), and Hawawini and 
Keim (1993). 

Section 2 presents the results in the futures and cash markets for the 
10 years from May 1982 to April 1992, and section 3 provides concluding 
remarks. 


2. The monthly return patterns in the cash and futures markets 


The data used in this study consist of the daily cash and futures closing 
prices for the Value Line Composite and S&P 500 Index contracts for the 10-year 
period May 1982 to April 1992. The futures contracts were chosen as those 
contracts with the nearest expiry date. The rollover into the next contract does not 
occur in the day -4 to day +4 period that is the focus of this study, and hence 
cannot explain any of the anomaly. This period was chosen to not overlap with 
Ariel's (1987) study, which used cash data from 1963-1981, and to include the 
entire period of index futures trading. The Value Line Composite is an equally 
weighted index of about 1,650 stocks and is used as a proxy for small stocks. It is 
currently arithmetically weighted. However, it was geometrically weighted from 
May 1982 until January 1988. Futures contracts expiring in March, June, 
September, and December are traded on the Kansas City Board of Trade. 
Futures options on this index are traded on the Philadelphia stock exchange. The 
S&P 500 Index is a value-weighted index of 500 large-capitalized stocks and 
serves as the large-capitalized index. S&P 500 futures contracts with the same 
maturity as the Value Line are traded on the Chicago Mercantile Exchange, and 
futures options are traded at the Chicago Board of Trade. Daily volume in the 
S&P 500 futures contracts is in the 40,000-100,000 contracts range, with a 
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contract being $500 times the index value. A substantial portion of this volume is 
associated with various types of program trading, including index arbitrage. 

Daily volume in the Value Line Composite futures contract is less than 
1,000 contracts per day. This includes little if any index arbitrage trading. 
Figure 1 shows the small and large-capitalized returns in the cash and futures 
markets for the period May 1982 to April 1992 by trading day of the month. Figure 
1 shows percentage rates of return for both the cash and futures markets. Since 
the investment in a futures contract is $0, this is not a return in the investment 
sense. However, it is a useful scaling of the dollar profit accruing to the daily 
mark-to-market, adjusting for contract size and price. 

There were high returns on trading days -1 to +3 in the cash and -1 to +2 in 
the futures markets. Some, but not all of these days had returns significantly 
above average. This is consistent with the results documented for the cash 
market in the earlier period by Ariel in a paper that was published at the mid point 
of the time interval of our data sample. That is, the cash anomaly was well-known 
for part but perhaps not all of our data period and our results are an out-of-sample 
confirmation of Ariel's results. Unlike Ariel, we also find some anticipation of the 
turn-of-the-month, particularly on day -4 in the futures markets. The effect was 
more pronounced with the S&P 500 cash market. This is consistent with 
speculation on the futures markets anticipating the cash market gains starting on 
day -1 as documented by Ariel. The high degree of index arbitrage on the S&P 
contract carries the effect into that cash market. To the extent that the speculative 
anticipation by the futures markets is a type of intertemporal index "arbitrage," the 
anticipation effect is stronger for the S&P futures contract than for the Value Line 
contract, because of the greater arbitrage activity with that contract, which has 
fewer and more heavily traded stocks in its index. 

The comparison of the turn-of-the-month anomaly for cash and futures 
markets also allows us to study another potential explanation of the anomaly in 
the cash market. If ex-dividend dates do not occur evenly through the month, 
there will be a tendency for the cash (indices) to fall on those dates when there is 
a high incidence of ex-dividend activity. This could contribute to a seasonal 
anomaly in the indices, since they are not adjusted for dividends. However, the 
futures are essentially adjusted for dividends, because rational investors know the 
expiry date of the futures and can anticipate accurately the dividends to be paid to 
the index each day. Hence, they will set futures prices to reflect the distribution of 
this final ex-dividend index value. Thus, if the turn-of-the-month anomaly is an 
artifact of ex-dividend price drops in the cash index, it should vanish in the futures 
market. Since the anomaly still appears in the futures market, the ex-dividend 
explanation does not hold. 

From Ariel's analysis, the cash market is known to exhibit the anomalous 
turn-of-the-month behavior in which high profits are earned at the turn-of-the- 
month. One might hypothesize that speculators would adopt trading strategies at 
the turn-of-the-month that would drive these profits to zero. By going to a period 
subsequent to that of Ariel, we find that this anomalous turn-of-the-month 
behavior still arises in the cash market, although it may have been moved ahead 
in time. Thus, the anomalous behavior does not arise because of an ignorance of 
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the speculative opportunity. This means that cash market frictions are sufficiently 
large to make the speculative behavior unrewarding. The fact that the Value Line 
anomaly is more pronounced than the S&P anomaly might suggest that it is more 
difficult to speculate in the small-capitalized stocks, due to higher bid-ask spreads, 
for example. 


(a) Small Capitalization Stocks (Value Line Composite) 


Vole Une Cash Market Average: 


(b) Large Capitalization Stocks (S&P 500) 


EA wre 
iT E se con 


— SWP 600 Com Market Average 


Trading Day ofthe Monin 


Fig. 1. Average daily returns in the cash and futures market for small and large- 
capitalized stocks by trading day of the month, May 1982—April 1992. 
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Table 1 


Average daily returns (log values) in the cash and futures markets for small and large-capitalized 
stocks by trading day of the month, along with t-values for the hypothesis that the day's return 
differs from the mean return, May 1982-April 1992. In this and succeeding tables, t-values 
significant at the 5% level (two-tailed) are in bold type. 


(a) Cash Market Data 


Trading Average Value 
Day Daily Return (%) Line S&P 500 
Value Line S&P 500 t-value t-value 
-9 -0.0813 0.0992 -1.33 0.43 
-8 0.1101 0.1025 0.63 0.35 
-7 -0.1281 -0.0527 -2.40 -1.28 
-6 0.0470 0.0652 -0.03 0.06 
-5 -0.1163 -0.1239 -1.61 -1.68 
-4 0.0077 0.1246 -0.64 0.83 
-3 0.0333 0.0548 -0.24 -0.08 
-2 0.0655 0.0838 0.24 0.27 
-1 0.3218 0.2361 3.74 2.03 
1 0.1342 0.1309 1.34 0.79 
2 0.2769 0.2637 3.52 2.74 
3 0.1554 0.0852 1.46 0.27 
4 0.0303 -0.0123 -0.25 -0.79 
5 -0.0862 -0.1181 -1.84 -1.95 
6 0.0174 0.0124 -0.51 -0.61 
7 0.0463 0.0716 -0.04 0.13 
8 0.0148 0.0003 -0.51 -0.69 
9 0.0320 0.0647 -0.25 0.05 


Average -9 to 9 0.0489 0.0604 
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(b) Futures Data 


Trading Average Value 
Day Daily Return (%) Line S&P 500 
Value Line S&P 500 t-value t-value 
9 0.1230 0.0973 0.68 0.38 
8 0.0465 0.1959 -0.04 0.76 
7 -0.2346 -0.0932 -2.68 -1.37 
-6 0.1176 0.0317 0.65 -0.28 
-5 -0.2533 -0.0779 -2.13 -1.12 
-4 0.1329 0.1757 0.95 1.31 
-3 0.0553 0.0315 0.03 -0.31 
-2 0.0695 0.0184 0.19 -0.39 
-1 0.2082 0.1285 1.82 0.72 
1 0.3314 0.2122 2.72 1.54 
2 0.2567 0.2541 2.11 2.18 
3 0.0572 0.0467 0.04 -0.10 
4 -0.0217 0.0000 -0.72 -0.53 
5 -0.1087 -0.1276 -1.44 -1.61 
6 0.0637 0.0354 0.12 -0.25 
7 0.0822 0.0410 0.29 -0.16 
8 0.0083 -0.0163 -0.47 -0.72 
9 0.0190 0.0759 -0.32 0.18 


Average -9 to 9 0.05295 0.0572 


Transactions costs and other frictions on the futures markets are lower and 
some researchers (e.g., Kleidon and Whaley [1992]) have suggested that the 
superior efficiency of order execution in futures trading mechanisms can allow de- 
coupling between the futures and cash markets. This suggests the hypothesis 
that the turn-of-the-month anomaly will be less pronounced in the futures market 
than in the cash market. To the extent that the more-pronounced Value Line 
anomaly is a result of greater frictions in the small-capitalized cash market, one 
might further hypothesize that any attenuation of the anomaly in the futures 
market will be more noticeable with the Value Line futures than the S&P futures. 
Neither of these hypotheses seem to hold. An examination of Figure 1 and Table 
1 shows that the futures market anomalies are just as pronounced as the cash 
market anomalies. The Value Line futures anomaly is more pronounced than the 
S&P futures anomaly, with no evidence of attenuation. 

One might also hypothesize that the futures market anomalies would 
anticipate the cash market anomalies because the futures market has lower 
frictions and because of delays in the settlement process. Indeed, the futures 
market anomalies are larger on day -4 and smaller on day -1. However, the 
futures market anomaly is also larger on day +1, suggesting that the futures 
market has not fully anticipated the profits in the cash market. 
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The effects of seasonality by month and period of the month are 
investigated in Table 2. Average daily logarithmic returns were computed during 
the turn-of-the-month (trading days -1 to +4), called TOM, the first half of the 
month (trading days -1 to +9), called FH, and the rest of the month (trading days 
+10 to -2), called ROM. The average returns and t-values for the hypothesis of 
positive returns and returns above average are given by month, and for all months 
with and without January, for the cash and futures data. 

There is a strong monthly seasonality effect shown in the mean returns 
during the turn-of-the-month, the first half of the month, and rest of the month 
shown in Table 2. As expected, January has very high small-stock returns during 
TOM. The results for 1982-1992 are consistent with Roll's (1983) results for 
1963-1978 that the small stocks greatly outperformed the large-capitalized stocks 
on trading days -1 to +4 in January. 

In addition, the t-statistics indicate higher than average returns for the 
Value Line Cash Index during TOM and the FH during February, March, June, 
and November. The returns are also high in these months for the S&P 500, but 
the t's are not significant. July and September and the category “all months 
except January” also have Value Line cash returns that are significantly below the 
average returns during the second half of the month. 

Also, all the months except January, February, August, and December 
have negative or negligible ROMs. The S&P 500 Index is similar but less 
pronounced. The data suggest that the seasonality in small stocks is much 
greater than in large stocks. 

The returns during TOM and ROM are significantly higher than average for 
the small stocks. Moreover, the small stocks have significantly lower than 
average returns during ROM. The cash effect was stronger during these periods 
for the small stock Value Line Index in the cash market because there was some 
anticipation in the futures market. The large-capitalized S&P 500 Index results 
are broadly similar. However, the t-values are lower and not significant at the 5% 
level during FH and ROM. 

Although the futures market results in Table 2 are similar to those in the 
cash market, the futures markets have fewer significant t-values. This is probably 
due to the greater ease with which speculative trading can take place to anticipate 
and mitigate the effect at the turn-of-the-month. The speculative anticipation, 
while not statistically significant, has the effect of reducing the TOM and FH 
returns in the futures markets and therefore, the number of significant t-values. 

Table 3 shows similar results using regression models with dummy 
variables for the TOM, second week (SW), and ROM periods. It is not possible to 
run a regression with all three subperiods because they are linearly dependent. 
Hence, separate regressions using each period as a dummy (0,1) variable were 
run to isolate the different effects. 
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Average daily returns and ¢-values for the hypothesis that returns are greater than zero and the mean return, respectively, in the cash and futures markets for 
small and large capital stocks during the turn of the month, first half of the month, and rest of the month, May 1992~April 1992. 


Value Line Index 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 

- All Months 
All except Jan 


S&P 500 Index 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All Months 
All except Jan 


Table 2 


(a) Cash Market Data 


Average Daily Returns (%) 


TOM 
-1to4 


0.5483 
0.3839 
0.3229 
0.0054 
0.1464 
0.2203 
0.0278 
-0.0212 
0.0495 
0.1272 
0.3421 
0.0521 
0.1837 
0.1506 


0.2730 
0.2116 
0.2220 
0.0308 
0.1393 
0.2074 
-0.0456 
0.0234 
0.0138 
0.2029 
0.2776 
0.1323 
0.1407 
0.1287 


FH 
-1to9 


0.2547 
0.2479 
0.1712 
0.0161 
0.1147 
0.0709 
0.0416 
0.0036 
-0.0804 
0.0422 
0.2321 
0.0170 
0.0943 
0.0797 


0.0716 
0.1471 
0.0987 
0.0432 
0.0872 
0.0579 
0.0348 
0.0540 
-0.0961 
0.1256 
0.1500 
0.1073 
0.0734 
0.0736 


ROM 
10 to -2 


0.1087 
0.1030 
0.0099 
0.0051 
-0.0084 
0.0006 
-0.0927 
0.0864 
-0.0801 
-0.3121 
-0.0711 
0.1410 
-0.0112 
-0.0225 


0.1371 
0.0909 
0.0297 
0.0680 
0.0377 
0.0521 
-0.0332 
0.1129 
-0.0443 
-0.1763 
-0.0718 
0.1531 
0.0290 
0.0188 


All Days 
-1to-1 


0.1769 
0.1784 
0.0842 
0.0104 
0.0494 
0.0336 
-0.0291 
0.0493 
-0.0802 
-0.1518 
0.0768 
0.0819 
0.0388 
0.0261 


0.1065 
0.1202 
0.0615 
9.0560 
0.0609 
0.0548 
-0.0009 
0.0865 
-0.0698 
-0.0397 
0.0364 
0.1313 
0.0501 
0.0449 


t-Values (Ho: return = 0) 


TOM 
-1to4 


FH ROM 
-1to9 10to-2 
2.61 1.42 
3.61 1.56 
2.94 0.18 
0.22 0.09 
2.16 -0.14 
1.18 0.01 
0.68 -1.59 
0.04 1.07 
-1.04 -1.40 
0.55 -1.61 
2.46 -1.07 
0.22 1.70 
4,30 -0.44 
3.59 -0.83 
0.58 1.46 
1.68 1.01 
1.28 0.41 
0.42 0.82 
1.15 0.48 
0.67 0.65 
0.44 0.44 
0.54 1.44 
-0.97 -0.56 
1.24 -0.69 
1.38 -0.78 
1.05 1.66 
2.65 0.88 
2.62 0.54 


All Days 
-1 to -1 


t-Values (Ho: ret diff = 0) 


TOM 
-1to4 


FH 
-1to9 


10 to -2 


ROM All Days 
-1to-1 


Avg Daily VL Index Ret = 0.0388 


4.08 
3.59 
3.51 
-0.36 
1.44 
2.16 
-0.12 
-0.42 
0.11 
0.93 
2.06 
0.10 
4.61 
3.49 


2.21 


0.91 

0.97 
-0.53 
-0.57 
-0.77 
-0.58 
~2.26 

0.59 
-2.07 
-1.81 
-1.65 

1.23 
-1.95 
2.26 


2.26 
2.91 
1.14 
-0.61 
0.26 
-0.12 
-1.60 
0.17 
-2.49 
-4.70 
0.65 
0.76 
0.00 
-0.72 


Avg Daily S&P Index Ret = 0.0501 


1.43 
1.36 
1.56 
-0.15 
0.86 
1.29 
-0.78 
-0.17 
-0.26 
1.08 
1.55 
0.48 
2.31 
1.95 


0.17 
1.11 
0.63 
-0.07 
0.49 
0.09 
-0.19 
0.04 
-1.48 
0.74 


0.93 
0.46 
-0.28 
0.22 
-0.16 
0.03 
-1.11 
0.62 


0.74 
1.12 
0.22 
0.09 
0.20 
0.08 
-0.94 
0.51 
-1.90 
-0.61 
-0.19 
1.18 
0.00 
-0.23 
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Value Line Futures 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All Months 
All except Jan 


S&P 500 Futures 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All Months 
All except Jan 


(b) Futures Market Data 


Average Daily Returns (%) 


TOM 
-1to4 


0.5367 
0.3537 
0.2795 
0.0419 
0.1362 
0.1601 
0.0204 
-0.1085 
-0.0024 
0.1516 
0.2528 
0.1488 
0.1642 
0.1304 


0.2437 
0.2042 
0.1839 
0.0111 
0.1235 
0.1585 
-0.0053 
-0.0243 
-0.0163 
0.4708 
0.2680 
0.2097 
0.1273 
0.1167 


FH 
-1to9 


0.1926 
0.2115 
0.1467 
0.0186 
0.1228 
0.0709 
0.0715 
-0.0306 
-0.0974 
0.0853 
0.2404 
0.0318 
0.0887 
0.0792 


0.0485 
0.1396 
0.0743 
0.0170 
0.0871 
0.0484 
0.0566 
0.0383 
-0.1185 
0.1284 
0.1499 
0.1094 
0.0649 
0.0664 


ROM _ All Days 
10to-2 -1to-1 
0.1327 0.1606 
0.0859 0.1510 
-0.0165 0.0584 
-0.0247 -0.0039 
-0.0043 0.0551 
-0.0067 0.0298 
-0.1144 -0.0267 
0.0844 0.0329 
-0.0996 -0.0985 
-0.3130 -0.1336 
-0.1097 0.0602 
0.1833 0.1115 
-0.0186 0.0321 
-0.0328 0.0202 
0.1186 0.0859 
0.0738 0.1079 
-0.0060 0.0308 
0.0480 0.0331 
0.0254 0.0542 
0.0493 0.0489 
-0.0649 -0.0073 
0.0987 0.0716 
-0.0529 -0.0852 
-0.1972 -0.0505 
-0.1122 0.0150 
0.1494 0.1303 
0.0100 0.0360 
-0.0002 0.0314 


t-Values (Ho: return = 0) 


TOM 
-1to4 


3.43 
2.38 
2.25 
0.26 
1.36 
1.29 
0.13 
-0.52 
-0.02 
0.94 
1.36 
1.02 
3.69 
2.82 


1.38 
1.47 
1.62 
0.07 
1.15 
1.11 
-0.03 
-0.13 
-0.10 
1.02 
1.49 
1.22 
2.83 
2.51 


FH 
-1to9 


ROM 
10 to -2 


1.27 

0.93 
-0.21 
-0.25 
-0.05 
-0.07 
-1.36 

0.75 
-4.06 
-1.12 
-1.08 

1.80 
-0.51 
-0.85 


1.13 
0.77 
-0.07 
0.52 
0.30 
0.52 
-0.79 
0.89 
-0.56 
-0.55 
-1.07 
1.45 
0.24 
0.00 


All Days TOM 
-1 to -1 -1 t04 


FH 
-1to9 


t-Values (Ho: ret diff = 0) 
ROM _ All Days 
10 to -2 


-1 to -1 


Avg Daily VL Futures Ret = 0.0321 


1.82 3.22 
2.15 2.17 
1.05 1.99 
-0.05 0.06 
0.99 1.04 
0.47 1.03 
-0.40 -0.07 
0.39 -0.67 
-1.38 -0.24 
-0.82 0.74 
0.76 1.19 
1.63 0.80 
1.32 2.97 
0.80 2.13 


Avg Daily S&P Futures Ret = 


0.98 1.18 
1.55 1.21 
0.53 1.30 
0.44 -0.16 
0.91 0.81 
0.72 0.86 
-0.12 -0.26 
0.89 -0.32 
-1.15 -0.33 
-0.25 0.80 
0.18 1.29 
1.78 1.01 
1.34 2.03 
1.11 1.74 


4.10 0.96 
1.72 0.58 
1.44 -0.63 
-0.11 -0.58 
1.20 -0.45 
0.44 -0.42 
0.38 -1.74 - 
-0.50 0.47 
-1.19 1.41 
0.46 -1.24 
1.72 1.40 
0.00 1.48 
1.80 1.39 
1.49 -1.68 
0.09 0.79 
1.03 0.39 
0.47 -0.52 
-0.15 0.13 
0.63 -0.12 
0.12 0.14 
0.21 -1.23 - 
0.02 0.57 
-1.34 -0.95 
0.77 -0.65 -0. 
0.90 -1.41 
0.70 1.10 
0.91 -0.62 
0.94 -0.80 


1.46 
1.70 
0.47 


0.0360 
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Regression results for turn-of-the-month effects for cash and futures markets. 


Value Line 


S & P 500 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Cash 


Futures 


Intercept 


-0.000062331 
(-0.320) 
-0.00008773 
(-0.315) 


-0.000112 
(-0.480) 
-0.0000186 
(-0.555) 


0.000048697 
(0.140) 
0.000131 
(0.262) 


0.001837 
(0.527) 
0.001642 
(3.285) 


Intercept 


0.000219 
(0.879) 
0.000076605 
(0.249) 


0.00029 
(0.966) 
0.000099614 
(0.269) 


Table 3 


TOM 


0.001899 
(4.756) 
0.00173 
(3.025) 


0.00195 
(4.639) 
0.001828 
(3.039) 


SW 


0.000161 
(0.383) 
0.000317 
(0.526) 


ROM 


0.001788 
(3.625) 
0.00070706 
(2.138) 


TOM 


0.001188 
(2.325) 
0.001196 
(1.894) 


0.001117 
(2.078) 
0.001173 
(1.766) 


-0.001788 
(-3.625) 
-0.001511 
(-2.138) 


SW 


-0.000228 
(-0.424) 


-0.000074205 


(-0.112) 


-0.000161 
(-0.383) 
0.00060164 
(-0.526) 


-0.00195 
(-4.639) 
-0.001828 
(-3.039) 


ROM 


0.000061797 
(0.138) 
0.000025409 
(0.046) 


0.001407 
(3.152) 
0.001273 
(3.285) 


0.001345 
(2.131) 
0.001247 
(1.599) 


-0.001345 
(-2.131) 
-0.001247 
(-1.599) 


0.000228 
(0.424) 
0.000074205 
(0.112) 


-0.001117 
(-2.078) 
-0.001173 
(-1.766) 
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Cash Market: 


Value Line Cash Market 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 


S&P 500 Cash Market 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 


Table 4 


Growth of $1 investment for small and large capitalized stocks in various periods, May 1982—April 1992. 


Growth of $1 Investment 


TOM 
—1to4 


1.31539 
1.21160 
1.17520 
1.00272 
1.07595 
1.11645 
1.01399 
0.98944 
1.02506 
1.06565 
1.18652 
1.02641 
3.01108 
2.28912 


1.14627 
1.11162 
1.11741 
1.01551 
1.07212 
1.10926 
0.97746 
1.01177 
1.00693 
1.10680 
1.14888 
1.06837 
2.32626 
2.02942 


FH ROM All Days 
1to9 10to-2 -1to-1 
1.29004 1.13193 1.46024 
1.28128 1.09938 1.40861 
1.18676 1.01164 1.20057 
1.01623 1.00556 1.02187 
1.12155 0.99050 1.11089 
1.07345 1.00064 1.07413 
1.04248 0.90220 0.94053 
1.00359 1.11211 1.11610 
0.92279 0.92078 0.84969 
1.04312 0.68550 0.71505 
1.26119 0.92802 1.17041 
1.01711 1.16779 1.18778 
3.10035 0.86112 2.66979 
2.40330 0.76076 1.82832 
1.07421 1.16913 1.25589 
1.15848 1.08725 1.25956 
1.10378 1.03535 1.14280 
1.04413 1.07551 1.12298 
1.09108 1.04347 1.13851 
1.05961 1.06061 1.12383 
1.03546 0.96383 0.99801 
1.05548 1.14891 1.21266 
0.90834 0.95544 0.86786 
1.13381 0.80792 0.91603 
1.16187 0.92741 1.07753 
4.11331 1.18344 1.31753 
2.41413 1.46965 3.54792 
2.24736 1.25705 2.82503 


FUTURES: 


Value Line Futures 
January 
February 
March 
Aprii 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 


S&P 500 Futures 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 


Growth of $1 Investment 


TOM 
1te4 


1.30778 
1.19347 
1.15000 
1.02119 
1.07050 
1.08334 
1.01023 
0.94718 
0.99882 
1.07876 
1.13476 
1.07725 
2.67908 
2.04856 


1.12958 
1.10752 
1.09629 
1.00554 
1.06369 
1.08250 
0.99736 
0.98791 
0.99191 
1.08913 
1.14337 
1.11057 
2.14628 
1.90007 


FH 


ROM 


All Days 


1109 10to-2 =1to-1 


1.21245 
1.23555 
1.15802 
1.01872 
1.13062 
1.07348 
1.07409 
0.96987 
0.90723 
1.08907 
1.27172 
1.03228 
2.89817 
2.39034 


1.04971 
1.14983 
1.07711 
1.01716 
1.09100 
1.04955 
1.05826 
1.03906 
0.88827 
1.13696 
1.16170 
1.11560 
2.17926 
2.07606 


1.16484 
1.08313 
0.98069 
0.97369 
0.99508 
0.99250 
0.87976 
1.10945 
0.90159 
0.68255 
0.89022 
1.22568 
0.77980 
0.66945 


1.14479 
1.07107 
0.99291 
1.05318 
1.02936 
1.05734 
0.93050 
1.12911 
0.94693 
0.78618 
0.88791 
1.17858 
1.14223 
0.99777 


1.41232 
1.33826 
1.13566 
0.99192 
1.12506 
1.06542 
0.94495 
1.07602 
0.81795 
0.74335 
1.13211 
1.26524 
2.26000 
1.60020 


1.20169 
1.23154 
1.06948 
1.07125 
1.12303 
1.10973 
0.98471 
1.17321 
0.84112 
0.89386 
1.03148 
1.31482 
2.48922 
2.07143 
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The TOM variable is always highly significant for the small-capitalized 
stocks and either significant or nearly significant for the large-capitalized stocks. 
The second week and the rest of the month have insignificant returns when TOM 
and the mean return are accounted for. When TOM is excluded, then both the 
second week of the month and the rest of the month have returns significantly 
below the mean return. 

Table 4 shows the growth of one dollar over the 10 years of our sample in 
the cash and futures market for small and large-capitalized stocks during the turn, 
first half, and rest of the month as well as during all days by month. These values 
show the compounded effects of monthly as well as intramonth seasonality during 
the TOM, FH, and ROM subperiods and the futures anticipation. 

In interpreting the results in Table 4, one must consider the number of 
trading days in each period: TOM had 5, FH had 10, ROM had about 10-12, and 
the whole month had about 20-22. The highest dollar returns in the cash and 
futures market were during the short time at the turn-of-the-month. The January 
small stock effect is seen in the high Value Line futures returns in the second half 
of December (partial anticipation) and the turn-of-the-month for January. There 
were extremely low returns in the second half of September and October in the 
cash and futures markets. 

A normality test was made and the results are shown in Table 5. In the 
cash market the normality of the first-half price changes cannot be rejected, but 
the returns were non-normal during the turn-of-the-month and the rest of the 
month. The high returns during TOM and low returns during ROM in the cash 
market apparently lead to the rejection of normality because of their skewness. 
The smoothing effect of the futures anticipation yields distributions that are not 
skewed enough to reject the normality hypothesis. 


Table 5 
Results of Normality Tests 


P-Values* 
TOM FH ROM 
Value Line 
Futures 0.45 0.08 0.00 
Cash Market 0.00 0.12 0.00 
S&P 500 
Futures 0.79 0.29 0.00 
Cash Market 0.01 0.39 0.00 


The P-Value is the probability of rejecting the null hypothesis that the input 
data values are a random sample from a normal distribution when true. 
The test used calculates the Shapiro-Wilks W statistic, with small values of 
W leading to rejection of the null hypothesis. 
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The distributions of returns in the futures market in the three parts of the 
month are shown in Figure 2. The results show that there are differences in the 
mean return during the turn, first half, and rest of the month that affect the total 
returns during these periods as discussed above. As Ariel (1987) found for the 
1963-1981 cash data, the higher returns during TOM and ROM were caused by a 
slight shifting of the return distribution and not by a few outliers. These graphs are 
for the entire period and aggregate the stronger effects of individual months. The 
cash distributions, though slightly more skewed to the right, are similar and show 
the same effects. 


3. Concluding Remarks 


The results show that there has been a substantial turn-of-the-month effect 
in US stock prices during the period 1982-1992 of index futures trading for small 
and large stocks. To avoid data snooping biases such as those discussed by Lo 
and MackKinlay (1990), we defined the turn to be first half and rest of the month as 
Ariel (1987) did in his study of the cash markets from 1963-1981. The turn is 
composed of trading days -1 to 4, the first-half days -1 to +9, and second-half 
days +10 to -2. The results show that the mean returns in the cash and futures 
market were positive in the first half of the month and negative in the second half 
for the Value Line. 

Moreover, during the turn-of-the-month, the mean return in the cash and 
futures market was above average. Partial anticipation occurred in the futures 
market on the three days prior to the turn-of-the-month, particularly on trading day 
-4. This anticipation was larger for the large-capitalized S&P 500 Index futures 
than for the small-capitalized Value Line futures. Moreover, it carries over to the 
S&P cash market, even though is does not appear in the Value Line cash market. 
This is consistent with the existence of considerable index arbitrage in the S&P 
500 and virtually no such arbitrage in the Value Line. 

Monthly seasonality effects were strong, with high returns occurring in 
January and the rest of the first quarter during the turn of the month. May, June, 
October, and November also had high returns during TOM. The rest of the month 
generally had negative returns, with the lowest returns in September, October and 
November. The media's widely discussed negative returns in October are 
supported by the data, but the negative returns were all in the ROM, while the 
TOM and FH were positive. The shift in mean return was larger for the small than 
for the large-capitalized stocks. 

The cumulative effects of investment during various time periods magnify 
the effects. The results indicate that the total return over this 10-year period was 
mostly received during the turn-of-the-month in the cash and futures markets for 
small and large-capitalized stocks. There were small gains during the rest of the 
month for the large stocks and losses for the small stocks. These gains for the 
large stocks were essentially a result of the anticipation of the turn-of-the-month 
cash gains in the futures market on the previous three days. 
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(a) | Small Capitalized Stocks (Value Line) 
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Fig. 2. Relative frequency return distribution in the futures markets for small and 
large-capitalized stocks, May 1982—April 1992. 
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The return distributions during the three parts of the month show that the 
larger returns at the beginning of the month and lower returns the second half of 
the month were not due to several large outliers but were instead due to a shift in 
the mean return and a skewness of the distribution. 
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Worldwide security market anomalies 
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Systematic violations of security market efficiency occur in equity markets because 
of the timing and reaction to cash flows and other information, institutional 
constraints and policies, and investor behaviour. They lead to significantly different 
risk-adjusted returns to those expected. Taking these anomalies into account 
provides opportunity for superior investment performance. 

Classifying anomalies as fundamental or seasonal differentiates between individual 
securities and market timing with indices. Seasonal anomalies include the January 
small-firm, turn-of-the-month, holiday, and day-of-the-week effects. Seasonality 
calendars combine the various effects to provide daily return forecasts. 

Fundamental anomalies include price to earnings, price to book, market 
capitalization, dividend yield, earnings trends and surprises, and mean reversion 
effects. These variables add explanatory power to that from risk measures and yield 
factor models to separate the best from the worst performing individual securities. 

Anomalies are controversial, difficult to measure and variable in time through 
investor sentiment and futures anticipation. Their study is interesting and 
challenging, and they are useful in various areas of portfolio management. 


1. Introduction 


One of the most basic and useful paradigms of modern financial economics is the 
efficient market hypothesis. In a pure form the hypothesis may be stated as: current 
security prices reflect all publicly available information. This, surely, is false. More 
appropriate for actual markets is that current security prices reflect all publicly 
available information to the point where the marginal profits from acting on the 
information do not exceed the marginal costs. In other words, current prices are 
within a transactions cost band. In a wide variety of equity, commodity and other 
financial markets, this is more or less true. However, rigorous and defensible tests of 
this version of the hypothesis are difficult to carry out and equally hard to interpret. 
First there is a messy problem of dealing with transactions costs. Moreover, as 
stressed by Roll (1977), such a test, if properly undertaken, should be a joint 
hypothesis concerning market efficiency and the validity of an equilibrium pricing 
model, such as the Sharpe—Lintner capital asset pricing model or Ross’s arbitrage 
pricing model. 

The British statistician, Maurice Kendall, postulated the efficient market 
hypothesis in 1953. Price changes were assumed to follow a random walk. Kendall 
(1953) found considerable support for the hypothesis by examining British stock 
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prices as well as US commodity prices. Kendall’s work ushered in the efficient market 
era which is well represented by the important early studies by Roberts (1959, 1967), 
Black et al. (1972), Cootner (1964), Fama (1965) and Mandelbrot (1963). The early 
literature was surveyed by Fama (1970). 

Over the years market efficiency has evolved from studying weak, semi-strong and 
strong form efficiency (in which prices fully reflect price information, all publicly 
available information and all information respectively) to studies of the pre- 
dictability of returns based on fundamental variables, event studies of the 
adjustment of prices to specific public announcements and tests for private 
information. The evidence is that future returns are predictable from past returns, 
dividend yields and term-structure variables. This apparent violation of weak form 
efficiency of the constant expected returns model is confounded by the joint 
hypothesis problem of whether there is rational variation over time in expected 
returns or systematic deviations from fundamental value. Various security market 
regularities or anomalies can then be seriously analysed, as surveyed in Dimson 
(1988), Thaler (1992) and Ziemba (1988, 1994a), and the review articles by Fama 
(1991), Blume & Siegel (1992), Hawawini & Keim (1994) and Ziemba (19946). 

Anomalies that yield higher short-term equity returns seem to occur for six basic 
reasons: (a) increased cash flows just prior to and during the anomalous period, a 
portion of which is invested in equities; (b) institutional constraints and policies such 
as pension funds investments made on the last day of the month; (c) behavioural 
considerations such as investor sentiment leading to excess purchase or sale of 
equities in related but different securities; (d) the timing of favourable and 
unfavourable information flows such as the delays in reporting bad news; (e) market 
maker supply—-demand balances and bid—ask spread preferences; (f) the slowness of 
the market to react to new information such as earnings changes, favourable and 
unfavourable publicly available ranking changes. Whereas arguments may be made 
that increased returns occur because of increased risk, which is difficult or impossible 
to measure accurately, there is very strong evidence that most or all the gains in 
equity markets actually occur during a small subset of the year’s trading days, 
namely the seasonally anomalous periods. Ritter & Chopra (1989) and Cadsby (1992) 
show, for example, that the only periods where higher risk, as measured by the 
capital asset pricing model’s beta, is rewarded with higher equity returns is precisely 
at the anomalous periods such as the trading day before holidays, at the end of the 
week, at the turn of the month, in the first two weeks of January for small stocks, 
etc. Event studies are more straightforward and less controversial as they are able 
to provide more clear-cut evidence of the effect of the new information. Regarding 
strong form tests there is considerable evidence that corporate insiders have private 
information that is not fully reflected in current prices. 

In the study of anomalies several key questions must be considered. 

1. What are the anomalies, and do they really exist as opposed to their being 
simply random data perturbations discovered because many data sets have been 
investigated simultaneously? See Merton (1985), Lo & MacKinlay (1990), Black 
(1986, 1992) and Markowitz (1992) for discussion of this point. 

2. Why do they occur? 

3. How reliable are they ? 

4, How can you tell when the conditions that led to the existence of the anomaly 
have changed so that the anomaly no longer exists and prices are completely random 
around some expected value ? 
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5. Is the anomaly in question anticipated or altered in the futures, options or other 
derivative security markets ? 

6. Can investors profit from such anomalies in a speculative sense ? 

7. Can investors use anomalies in an organized way in portfolio management to 
achieve superior risk-adjusted performance? 

It is a large task to survey and discuss all the various anomalous areas, and our 
space is short. Hence we concentrate on one very important anomaly, namely the 
turn-of-the-month effect, and trace some of the results in worldwide markets. Along 
the way, evidence will be presented concerning questions 1-7. We refer the reader to 
the detailed survey papers mentioned above for specifics of further modelling, 
fundamental anomalies based on concepts such as price-earnings ratios, earnings 
surprise, etc., and other seasonal anomalies. The premier seasonal anomaly is the 
January small firm effect. It has been studied by many authors such as Banz (1981), 
Reinganum (1981), Blume & Stambaugh (1983), Keim (1983), Clark & Ziemba 
(1987), Jaffe, Keim & Westerfield (1989), Fama (1991), Prahl (1993) and Ziemba 
(1994c). Some of the turn-of-the-month discussion here will consider separately the 
unique month of January. 


2. The turn-of-the-month effect in US equity markets 


Investment advisors such as Fosback (1976) and Hirsh (1986) have argued that US 
stocks have substantial rises at the turn of the month. Ariel (1987) has documented 
the effect for small and large capitalization stocks for the 19 years from 1963 to 1981. 
His data consisted of the equal-weighted and value-weighted indices of all New York 
Stock Exchange (NysE) stocks from the Center for Research on Security Prices 
(CRSP) tape. The turn of the month (Tom) is defined to be the last trading day of the 
previous month (—1) and the first four trading days of the new month (+1 to +4). 
Ariel’s research showed there were very high returns during Tom. The rest of the 
market gains during 1963-1981 occurred in the second week of the month (days +5 
to +9). The first half of the month (FH), namely trading days —1 to +9 or Tom plus 
the second week, had all the gains. The second half of the month (Rom, for the rest 
of the month), which is trading days +10 to —2, had negative returns. Hence, 
investment in the first half of the month provided more than all the year’s stock 
market gains. 

Lakonishok & Smidt (1988) investigated various seasonal anomalies using a 90 
year data set on the Dow Jones Industrial Average (p14) from 1897 to 1986. The ps1a 
is a large capitalization price-weighted index of 30 major NYSE stocks. They found 
that the index rose 0.475% during the four-day period of —1 to +3 each month, 
whereas the average gain for a four-day period was 0.061%. The average gain per 
month over these 90 years was 0.349 %. Hence, aside from these four days at the turn 
of the month, the DJIA had negative returns. 

Ogden (1987, 1990) provides empirical support for the hypothesis about flow of 
funds into the stock market from cash flows and monetary action of agencies such as 
the Federal Reserve System. According to Moody’s Manuals, 70% of the interest and 
principal payments on corporate debt (90% on municipal debt) are payable on the 
first or last days of the month. Furthermore, data in Standard & Poor’s Stock Guide 
indicate that about 45% of dividends on common stock (65% on preferred stock) 
are payable on the first or last day of the month. The payable dates for interest, 
principal and dividend payments on corporate and municipal securities have been 
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consistently on these dates throughout the 20th Century. Ogden’s research provides 
empirical support for the hypothesis that stock prices tend to rise at the beginning 
of months that are preceded by months in which aggregate, economy-wide liquid 
profits are large. In particular, year-end bonuses, large Christmas sales (which, 
according to Schwadel (1988), typically amount to 35% to 55% of annual retail 
sales) and other cash flows make December the highest economic activity month and 
seem to provide part of the reason for January’s high returns during the FH and 
throughout the month for small stocks. 

The following explanations for the turn-of-the-month effect have also been 
advanced: inventory adjustments of different traders (Ritter 1988); the timing of 
trades by informed and uninformed traders (Admati & Pfleiderer 1988) and 
specialists’ strategies in response to informed traders (Admati & Pfleiderer 1989; 
Prahl 1993); seasonal tax-induced trading (Lakonishok & Smidt 1986); and window 
dressing induced by periodic evaluation of portfolio managers (Haugen & Lakonishok 
1988; Ritter & Chopra 1989). 

The issue of data mining discussed by the authors mentioned above is an 
important one to consider, especially because, even with strong seasonality effects, 
the average daily magnitudes of these anomalies are typically less than a two-way 
transactions cost, which is about 0.46 % for NYSE stocks according to Berkowitz et al. 
(1988), or the typical bid-ask spread, which at an eighth is 0.31% of the average- 
priced $40 NysE stock. 

Perhaps the best remedy against data snooping is new data and convincing reasons 
for the effects. The reasons for the turn-of-the-month effect are largely cash-flow and 
institutionally based. Another factor seems to be behavioural. For example, bad 
news such as that relating to earnings announcements is delayed and announced late 
in the month, whereas good news is released promptly at the beginning of the month 
(see Penman 1987). 

To investigate this further, Hensel et al. (1993) used as a data set the daily closing 
prices of the Standard & Poor (S& P) 500 Stock Index for the 65-year period from 
February 1928 to June 1993. This data was supplied by Data Resources 
Incorporated. The S&P 500 is a value-weighted index of large capitalization US 
stocks. Since March 1957 this has consisted of the 500 largest stocks weighted by 
market value (price times number of shares outstanding). Before then it consisted of 
the 90 largest stocks. This index is called the S&P composite or S& P 500. Index 
futures contracts on the 8 & P 500 have been trading since 1982 on the Chicago 
Mercantile Exchange and futures options are traded at the Chicago Board of Trade. 
Hensel et al. (1994) have studied the turn-of-the-month and monthly return patterns 
of the S & P 500 and the Value Line Composite small stock index during the period 
of futures trading from 1982 to 1992. In the sequel we will discuss their results, 
particularly in light of question 5 on futures anticipation. The 8 & P 500 is an ideal 
index to study the turn-of-the-month effect because it has a large number of 
securities (500), is value-weighted (hence each security adds its true weight), is often 
used as a market portfolio, and is the most widely used passive portfolio benchmark 
to represent the US equity market in portfolio performance calculations and as an 
index portfolio. One weakness of this data set is the omission of dividends. Hence it 
is possible that there is a slight bias in some of the results that follow because of the 
unevenness of the dividend stream. However, we believe that this effect, if any, is 
minor, and does not change any of our conclusions. 

Figure 1 shows the average return pattern by trading day. The average return on 
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Figure 1. Average daily returns in the S& P 500 cash market by trading day of the month 
(February 1928 to June 1993). From Hensel et al. (1993). 


trading days —9 to +9 over the 65-day sample was 0.0303% per day. Days —9 to 
—1 are during the previous month and +1 to +9 during the current month. For 
example, day — 6 of December is the day in November that is six trading days before 
the start of December. Significantly higher returns occurred on trading days — 2 to 
+3. Hence, the bulk of the monthly returns occurred at the turn of the month. 

Figure 1 suggests that all the monthly gains occurred in the rH. The Rom had 
negligible returns. The first row of table 1a details these mean returns by trading day 
of the month. The last three columns have the average daily returns in the —9 to +9 
period, the rom (—1 to +4), and the rm (—1 to +9). The mean daily returns over 
the 65 years were higher in Tom at 0.1236% than in the period —9 to +9, at 
0.0303 %, with ru having average daily returns of 0.0695 %. The first row of table 1b 
gives the t-values for the hypothesis that the mean daily returns on the various 
trading days —9 to +9 were above the mean of the —9 to +9 trading-day returns, 
which was 0.0303 % per day. Trading days —1, +2 and +3 had significantly higher 
returns than average. Trading days —2 and +1 had high average returns. All of the 
trading days from —9 to —3 had returns below average. Two of these (—8 and —5) 
were significantly negative. 

Tables 1a and 1b present the mean returns and t statistics, respectively, for the 
hypothesis that the daily returns were greater than the —9 to +9 average by trading 
day of the month, by decade from the 1930s to the 1990s and by month. Trading days 
—1, +2and +3 had significantly higher mean returns than average in most decades 
and for the entire 65-year period. Every decade had high returns on — 1 and +2, and 
only in the 1980s were the returns on +3 negative. The latter may be the 
anticipation of the turn-of-the-month effect in the S & P 500 futures market and the 
associated index arbitrage; see figure 2 and table 2. 

The futures market anticipates the Tom on days —4 to +2, and the index arbitrage 
keeps the ask prices close to the futures prices. Once the futures index moves on day 
—4 the expensive futures are sold and cash stocks are purchased for the index 
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Table 1. (a) Average daily returns in the S & P 500 cash market by decade and month for monthly trading days —9 to +9 (February 1928 to June 1993). (b) t-values 

Jor (a) and figure 1 for the hypothesis that the mean daily returns are greater than the overall mean for the same time period (for days —9 to +9) by trading day of 

the month for the S £ P 500 cash market by decade and month (February 1928 to June 1993). From Hensel et al. (1993) 

Values significant at the 5% level (two tailed) are in bold. Months are tested against the —9 to +9 mean for the entire period. Specific periods are tested against 
the mean for the same period. 


(a) 
—9 to 

-9 -8 -7 -6 -5 —4 -3 —2 -1 +1 +2 +3 +4 +5 +6 +7 +8 +9 +9 TOM FH 
1928-93 ~0.02 —0.05 0.02 —0.01 —0.17 0.00 0.00 0.09 0.15 0.07 0.17 0.19 0.03 0.01 —0.06 0.07 0.02 0.03 0.03 0.12 0.07 
1928-39 0.08 —0.21 0.12 —0.01 —044 —0.14 —0.19 0.08 0.10 —0.01 0.17 0.40 0.24 0.13 —0.25 0.18 0.16 0.07 0.03 0.18 0.12 
1940-49 —0.12 —0.03 0.00 0.00 —0.07 -0.07 —0.02 0.25 0.16 0.09 0.21 0.19 —0.14 0.05 0.03 0.01 —0.16 —0.06 0.02 0.10 0.04 
1950-59 0.03 —0.02 0.10 0.05 —0.17 0.11 0.10 0.08 0.16 0.20 0.30 0.18 0.02 0.01 —0.20 —0.02 0.06 0.05 0.06 0.17 0.08 
1960-69 ~—0.08 —0.10 —0.04 -0.05 -0.20 -0.05 —0.03 0.07 0.15 0.04 0.14 0.13 0.10 0.02 0.02 0.11 0.02 0.02 0.01 0.11 0.07 
1970-79 —0.12 —0.08 —0.03 —0.08 —0.03 0.00 0.08 —0.05 0.09 0.02 0.03 0.21 0.03 0.00. 0.01 0.05 —0.01 0.00 0.01 0.07 0.04 
1980-93 0.03 0.10 —0.03 0.00 —0.07 0.11 0.06 0.09 0.23 0.10 0.19 0.04 —0.06 —0.10 0.04 0.08 0.02 0.08 0.05 0.10 0.06 

—9 to 

1928-1993 -9 —8 -7 —6 —5 —4 =g -2 =i +1 +2 +3 +4 +5 +6 +7 +8 +9 +9 TOM FH 
Jan. —0.08 —015 0.10 0.13 —0.06 0.07 0.05 O14 0.32 0.01 0.56 —0.01 0.19 —O017 0.10 0.07 0.08 —0.10 0.07 0.21 0.10 
Feb. 015 —0.13 —0.10 0.08 —0.28 —0.12 0.09 0.01 0.24 0.07 0.15 —0.16 0.07 —0.09 —0.26 0.12 0.00 0.00 —0.01 0.08 0.02 
Mar. ~0.05 0.03 —0.03 -0.18 —0.21 0.00 —0.06 0.09 0.16 0.20 0.22 0.14 0.22 -004 —0.12 0.17 —0.10 —0.12 0.02 0.19 0.08 
Apr. 0.19 —0.10 —0.04 —0.19 —0.22 0.03 —0.14 —0.13 —0.16 0.08 0.15 0.12 0.10 —006 —0.02 0.20 0.24 —0.06 0.00 0.05 0.06 
May —0.17 —O.11 0.04 -0.06 —-O.11 0.08 —0.07 0.22 0.31 —0.03 0.20 0.32 0.04 0.25 —0.21 O11 —0.19 0.04 0.04 0.17 0.08 
June — 0.04 0.00 0.18 0.02 —0.21 0.01 —0.15 0.08 —0.04 -0.02 0.14 0.3L 0.27 —002 —0.18 -—0.07 0.16 0.15 0.03 0.13 0.07 
July -0.07 —0.10 —0.13 0.23 —0.17 0.14 —0.05 0.10 0.14 0.28 0.32 0.39 —0.01 0.02 0.21 0.08 —0.05 0.31 0.09 0.23 0.17 
Aug. 0.03 0.05 0.14 0.07 —0.06 —0.05 —0.01 0.01 0.15 0.11 0.06 0.21 —0.19 0.13 0.03 0.22 —0.08 0.06 0.05 0.06 0.07 
Sept. —0.09 —0.01 0.24 -0.35 —0.25 —0.19 0.06 —0.22 0.25 —0.17 0.17 0.33 —0.09 —0.31 —0.32 0.11 0.02 —0.22 —0.05 0.10 —0.02 
Oct. —0.15 0.19 —0.11 —0.02 —0.49 —0.12 —0.10 0.30 0.00 0.06 0.26 0.18 —0.23 0.47 —0.07 —0.13 —0.26 0.40 0.01 0.04 0.06 
Nov. —0.03 —0.20 —0.14 0.15 0.02 0.00 —0.03 —0.07 0.21 0.25 —0.17 0.34 —0.06 —0.10 0.04 0.08 0.41 —0.05 0.04 0.11 0.09 


Dec. 0.03 —0.13 0.13 —0.01 —0.03 0.10 0.33 0.52 0.27 —0.02 0.10 0.15 0.11 0.12 0.04 —009 —0.04 —0.09 0.08 0.12 0.06 
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Table 1 (cont.) 


(6) 
9 to 

=p =8 i —6 Sg —4 =3 52 =ł +1 +2 +3 +4 +5 +6 +7 +8 +9 +9 TOM FH 
{928-93 —121 —1.99 -0.29 -612 -482 —0.86 —0.78 1.52 3.31 1.07 3.66 4.04 0.04 —0.38 —2.42 105 -0.27 —0.04 0.00 5.15 3.00 
{928-39 0.26 —1.4i 0.69 —0.21 —3.11 —1.02 —1.33 0.35 0.50 —0.29 0.86 2.33 1.08 0.59 —-1.91 0.93 0.88 0.25 0.00 2.10 Err 
1940-49 ~151 -0.74 —0.22 —-@25 —1.18 —0.97 —0.57 3.07 2.22 0.86 2.52 2.52 —1.88 0.37 0.12 —0.09 -—2.01 —0.93 0.00 2.57 0.74 
1950-59 —0.38 —1.15 0.65 —O.11 2.50 0.88 0.72 0.37 1.77 2.18 4.46 1.68 —0.72 —0.92 -401 -121 —0.04 —0.19 9.00 4.21 0.76 
1960—69 —2.07 -2.07 —104 —1.13 -3.88 —1.14 —6.48 0.84 2.21 0.34 2.00 2.28 1.50 0.11 0.14 L7tk 0.16 0.02 0.00 3.34 2.75 
1970-79 —-163 -1.05 —0.50 -1.20 -043 —0.12 0.88 —0.69 1.09 0.12 0.35 2.38 0.35 —O.11 0.10 0.52 —0.2} —0.12 0.00 1.67 1.28 
1980-93 — 0,24 048 —110 -069 —1.42 6.93 0.16 0.59 2.53 0.67 2.06 -0.10 -—143 -2.01 —0.20 0.4f —0.47 0.38 0.00 1.40 0.47 

—9 to 

1928-1993 —9 —8 =] —6 -5 —4 —3 -2 si +1 +2 +3 +4 +5 +6 +7 +8 +9 +9 TOM FH 
Jan —0.59 —1.31 0.91 0.96 —0.59 0.34 0.08 1.06 3.66 —0.13 3.33 —0.39 0.86 —135 0.55 0.42 0.59 —1.09 1,28 3.03 1.78 
Feb. 0.70 —1.66 —1.00 0.61 —2.13 —1.18 0.58 —0.12 2.31 0.40 1.26 —1.55 046 —-1.19 —2.44 0.59 —0.20 —0.33 ~1.26 1.02 —0.36 
Mar. —0.69 0.03 —0.62 —1.60 —2.08 —024 —0.63 0.45 1.49 1.85 1.58 1.01 0.73 —0.61 —1.10 1.30 -1.33 -1.22 —0.34 2.04 1.16 
Apr. 0.81 -0.92 —0.63 -2.48 —2.09 0.05 —1.38 —1.23 —1.26 0.29 1.02 0.91 0.45 —0.74 —0.19 1.41 1.35 —0.70 —0.77 0.32 0.54 
May —148 —0.89 0.22 —0.82 —1.06 0.42 —0.44 1.57 1.80 —0.55 1.49 2.70 0.02 140 —2.22 0.74 2.11 —0.07 0.21 2.34 1.27 
June —0.68 —0.16 £28 —0.i4 -150 -0.16 —1.68 0.32 —0.43 —0.31 0.78 1.53 144 —0.60 —163 —0.60 0.93 0.71 0.04 1.89 0.97 
duly —0.56 -0.74 —1.62 1.28 —1.22 0.75 —0.61 0.60 Lit 2.11 2.55 2.83 -0.33 —0.06 1.85 0.43 —0.80 2.02 1.71 3.58 3.96 
Aug. —0.02 0.12 0.87 0.36 —0.65 —0.70 -—040 —0.26 0.95 0.69 0.21 1.04 —-1.89 0.68 0.00 1.28 —0.91 0.26 0.43 0.61 0.74 
Sept —0.48 —0.28 1.15 —2.19 —1.47 —1.58 0.24 —1.79 2.04 —1.24 0.98 1.53 —1.31 —2.00 —3.18 0.67 —0.08 —1.39 —2.40 1.12 —0.93 
Oct. —1.01 0.72 —0.80 —035 —2.83 —064 —0.58 1.17 —0.21 0.15 1.92 0.83 —1.21 185 —0.62 —0.92 —1.68 2.18 —0.62 0.18 0.62 
Nav —0.36 —168 —1.27 0.89 —0.10 —0.18 —0.34 —0.84 1.17 1.48 —0.92 2.07 —046 —0.71 0.10 0.24 1.78 —0.61 0.21 1.17 1.08 
Dee. 0.14 —1.35 1.02 —042 —0.5! 0.85 2.44 4.50 164 —0.39 0.47 0.94 0.78 0.68 0.17 -0.88 —0.56 —1.45 2.08 1.51 0.66 
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Figure 2. Average daily returns in percent in the cash and futures markets for the 8 & P 500 by 
trading day of the month (May 1982 to April 1993) (from Hensel et al. 1994). 


arbitrage to lock in a nearly riskless profit and increase the level of the stock market. 
The higher mean returns during Tom and FH and the lower mean returns during ROM 
were partly an effect of a higher percentage of positive return days as well as a 
shifting of the conditional means on positive and negative return days. For example, 
in every decade from 1928 to 1993 the probability of a gain on any of the trading days 
—2 to +3 was always above 50%, and each of these days averaged above 55%. In 
contrast, trading days in Rom had positive returns on significantly less days which 
averaged less than 50%. The results are similar for the 65 year sample across months 
of the year. A table with detailed results appears in Hensel et al. (1993). Table 3, 
which separates the returns in TOM, FH and ROM by month, shows the results clearly. 
Every month had positive returns in ToM and nearly all had negative returns in ROM. 
The t-statistics for the hypothesis that the mean returns are greater than average 
appear in table 4. It is highly significant that the returns in Tom and the FH were 
above average and those in ROM below average. 

There is in excess of $500 billion in equity portfolios indexed to various bench- 
marks of which by far the largest sum is keyed to the S & P 500. The reason for this 
is low transactions costs and management fees plus the fact that most managers fail 
to beat these indices; see Ziemba & Schwartz (1991) for some data across the world. 
Hence a useful strategy may be to invest in the S & P 500 during rom or during the 
FH and be in cash the rest of the time. Table 5 shows the results of these strategies. 
Both of these strategies mean variance dominated the passive S & P 500 strategy 
during these 65 years. In particular the risk levels were much lower than those of the 
S& P 500 and small capitalized stocks. 
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Table 2. Average daily returns (log values) in the cash and futures markets for the S & P 500 by trading 
day of the month, along with t-values for the hypothesis that the day’s return differs from the mean return 
(May 1982 to April 1992). Significant values in bold. From Hensel et al. (1994) 


average 5& P 500 Average S&P 500 
trading S & P 500 cash cash S & P 500 futures futures 
day daily return (%) t-value daily return (%) t-value 
=9 0.0992 0.43 0.0973 0.38 
-8 0.1025 0.35 0.1959 0.76 
-7 — 0.0527 — 1.28 — 0.0932 — 1.37 
—6 0.0652 0.06 0.0317 —0.28 
—5 — 0.1239 — 1.68 —0.0779 — 1.12 
—4 0.1246 0.83 0.1757 1.31 
—3 0.0548 — 0.08 0.0315 —0.31 
—2 0.0838 0.27 0.0184 —0.39 
-i1 0.2361 2.03 0.1285 0.72 
1 0.1309 0.79 0.2122 1.54 
2 0.2637 2.74 0.2541 2.18 
3 0.0852 0.27 0.0467 —0.10 
4 — 0.0123 —0.79 0.0000 — 0.58 
5 —0.1181 — 1.95 — 0.1276 —1.61 
6 0.0124 —0.61 0.0354 —0.25 
7 0.0716 0.13 0.0410 —0.16 
8 0.0003 — 0.69 —0.0163 —0.72 
9 0.0647 0.05 0.0759 0.18 
average 
—9 to 9 0.0604 = 0.0572 = 


Table 3. Average daily returns in the S & P 500 cash market by month, during Tom, Fu, rom and whole 
month (February 1928 to June 1993). From Hensel et al. (1993) 


TOM FH ROM all days 
S & P 500 index —1 to +4 —1 to +9 +10 to —2 —1 to -1 
Jan. 0.2061 0.1025 0.0359 0.0651 
Feb. 0.0807 0.0170 —0.0214 — 0.0024 
Mar. 0.1876 0.0768 —0.0212 0.0208 
Apr. 0.0503 0.0566 — 0.0169 0.0161 
May 0.1653 0.0819 — 0.0836 —0.0107 
June 0.1287 0.0669 0.0033 0.0315 
July 0.2258 0.1697 — 0.0050 0.0738 
Aug. 0.0645 0.0672 0.0129 0.0364 
Sept. 0.0976 — 0.0175 — 0.0978 — 0.0605 
Oct. 0.0445 0.0632 — 0.0787 — 0.1780 
Nov. 0.1108 0.1038 — 0.0821 0.0071 
Dec. 0.1217 0.0564 0.0599 0.0584 
all months 0.1236 0.0703 — 0.0235 0.0186 
all except Jan. 0.1162 0.0674 — 0.0292 0.0143 


The results in table 5 are suggestive. But the reader should note the following: (a) 
the S&P 500 results include dividends; (b) the small capitalized index includes 
dividends and is the value-weighted average of the bottom 20% of nyse stocks; (e) 
the intermediate-term government bonds are total return for five to 10 year 
maturities; (d) the TOM and FH plus cash returns do not include dividends, hence the 
mean returns are biased lower by an amount that is probably greater than one might 
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Table 4. t-values for the hypothesis that the mean returns are greater than average in the S & P 500 cash 
market by month, during TOM, FH, ROM and whole month (1928-1993). From Hensel et al. (1993) 


TOM FH ROM all days 
S & P 500 index —1to +4 —1to +9 +10 to —2 —1 to -1 
Jan. 3.11 2.08 0.56 1.87 
Feb. 1.31 — 0.04 —0.96 — 0.77 
Mar. 2.61 1.41 —1.22 0.09 
Apr. 0.57 0.93 — 0.95 — 0.09 
May 2.84 1.65 — 2.56 —1.04 
June 1.55 1.02 — 0.38 0.42 
July 3.96 4.16 — 0.59 2.00 
Aug. 0.80 1.18 —0.15 0.65 
Sept. 1.22 —0.77 —2.33 —2.29 
Oct. 0.38 0.84 — 1.62 — 0.88 
Nov. 1.20 1.53 —2.41 —0.33 
Dee. 1.79 0.95 1.18 1.52 
all months 5.94 4.13 —3.71 0.00 
all except Jan. 5.28 3.54 —3.76 — 0.47 


Table 5. Comparison of the ToM and FH plus cash strategies with S & P 500 small capitalized stocks and 
intermediate term government bonds (1928-1993). From Hensel et al. (1993) 


yearly average yearly std. dev. of 
returns (%) returns (%) 
S & P 500 9.5 20.1 
Sm. Cap stocks 11.5 30.1 
It Gov. bonds 5.1 4.4 
TOM (— Í to +4)+0.8 cash 10.1 8.8 
FH (— 1 to +9)+0.6 cash 11.1 12.6 


expect to spend on transactions costs and by market impact using futures trading to 
implement these strategjes. 


3. The turn-of-the-month effect in worldwide equity markets 


The turn-of-the-month effect seems to occur in most large equity markets around 
the world. Ziemba (1989, 1991 ; see also the detailed summary in Ziemba & Schwartz 
1991) studied the effect in Japan. By using data on the Nikkei 225 Index for the years 
1949-1988, he found that the turn of the month is similar in Japan except that the 
dates change, with the turn being —5 to +2 and with +3 to +7 being the rest of 
the first half of the month. All the days —5 to +2 had significantly positive returns, 
as shown in table 6. The ru trading days —5 to +7 had mean returns of 0.01142% 
per day against 0.00093 for the Rom days +8 to —4. Hence, essentially all the gains 
were in the FH. 

The reasons for the effect in Japan during this period were as follows and added 
credence to the existence of the effect. 

(i) Most salaries were paid between the 20th and the 25th of the month, with the 
25th being especially popular. 

(ii) There was portfolio window dressing on day —1. 

(iii) Security firms could invest for their own accounts based on their capi- 
talization. Since their capitalization usually rose each month and was computed at 
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Table 6. Mean returns (%) on the Nsa by trading day of the month 1949-1988. From Ziemba (1991) 


trading sample mean trading sample mean 

day size return (Yo) t-statistic day size return (%) t-statistic 

—5 471 0.0899 2.75 10 471 0.062 1.58 

—4 471 0.1041 2.65 11 471 0.0395 1.05 

-3 471 0.1733 3.82 12 471 — 0.0196 —0.50 

—2 471 0.1334 3.18 13 471 0.0115 0.24 

—1 471 0.2255 5.36 14 471 — 0.0042 —0.10 
1 471 0.0980 2.43 15 471 — 0.0306 — 0.54 
2 471 0.1006 2.47 16 471 0.0716 1.35 
3 471 0.0307 0.72 17 471 — 0.0498 —1.20 
4 471 0.0592 1.15 18 429 — 0.0207 —0.49 
5 471 0.0358 0.81 19 350 0.0162 0.36 
6 471 — 0.0005 —0.01 20 229 — 0.0286 —0.48 
7 471 0.0357 0.09 21 118 —0.0476 —0.46 
8 471 — 0.0585 — 1.34 22 39 0.0562 0.45 
9 471 0.1065 2.46 


the end of the month, there was buying on day —3 to account for this. Buying was 
done as soon as possible. 

(iv) Large brokerage firms had a sales push that lasted 7--10 days starting on day 
—3. 

(v) Employment stock-holding plans and mutual funds received money to invest 
in this period starting around day —3. 

(vi) Individual investors buy mutual funds with their pay, which they receive on 
calendar days 15th—25th of the month; the funds were then invested in stocks with 
a lag, so most of the buying occurred on days —5 to +2. 

(vii) For low liquidity stocks, buying occurred over several days by dealing in 
accounts to minimize price-pressure effects. 

Ziemba (1989) investigated the futures market trading outside Japan on the 
SIMEX in Singapore during the turn of the month and examined other anomalous 
effects in Japanese security markets during the period of September 1986 to 
September 1988 before there was futures trading on the NSA or TOPIX in Japan. 
The results were that the spot effect was consistent with past data, so the futures 
market did not alter the effect. 

However, the futures market in Singapore totally anticipated the effect on days 
—8 to —5 with a total average rise of 2.8%; see figure 3. Then, when the effect 
occurred on days —5 to +2 and the spot market gained 1.7%, the futures market 
was flat. 

The turn-of-the-month effect existed and was strong in Japan from 1949 to 1988. 
However, the extreme decline in the Japanese stock market of over 60% from 
January 1990 to August 1992 so shook investor confidence that there was no 
confidence in equity investments. Hence, one would not expect that the usual cash 
flows would enter the market in the usual way to generate the Tom and FH high 
returns. Indeed this was the case, and returns during these periods were not higher 
than average during 1990-1993. See Bell & Ziemba (1994) for an analysis of Japanese 
anomalies during 1990-1993 and Stone & Ziemba (1993) for an analysis of this period 
in Japanese economic history. 

Cadsby & Ratner (1991) investigated turn-of-the-month and holiday effects in the 
US, Canada, the UK, Australia, Switzerland, West Germany, Japan, Hong Kong, 
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Figure 3. Daily rates of return in percent in the first and second halves of the month in Japan. Cash 
data, futures market effects September 1986 to September 1988 (from Ziemba 1989). 
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Figure 4. International Tom effects. Daily rates of return in percent for Canada (1/75-12/87), Hong 
Kong (1/80-1/89), U.K. (8/83-6/88), Australia, Switzerland, West Germany and France (all 
1/80-8/89) (from Cadsby & Ratner 1991). 


Italy and France. Their use of —1 to +4 as the turn of the month in all these 
countries makes their results hard to interpret since there may be cash flow and 
institutional constraint date alterations as in Japan. Still they found a significant 
turn-of-the-month effect in most countries, as shown in figure 4. A good example of 
a significant turn-of-the-month effect with different dates is that in Italy. The month 
there essentially starts on the 15th (approximately) and all trades from then up to 
the following 14th are settled on the last trading date of that calendar month. This 
system leads to high returns from the 15th to the end of that calendar month and low 
or negative returns from the + 1 day to the 14th; see Barone (1990). Not suprisingly, 
Italy has not had high returns on days —1 to +4 and does not appear in figure 4. 


4. Concluding remarks 


The turn-of-the-month effect seems to be a worldwide phenomenon. The paper by 
Martikainen et al. (1994), although it only considers the brief period from January 
1988 to January 1990, shows this with various indices for a variety of regions of the 
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world. They also found that returns on individual days during Tom, especially — 1, 
were high. The latter averaged 0.41 % per day over 24 countries with af value of 3.89 
and was the only trading day with a mean return different from zero. 

The Tom, FH and Rom results are useful for traders and portfolio managers. 
However, points 1-7 mentioned earlier in this paper must always be considered. One 
of us (W. T. Ziemba) has used such ideas in several investment accounts with some 
success. Although there was co-mingling of various anomalous ideas such as those 
discussed in Ziemba (1994b), the ideas discussed here were useful inputs to a 
diversified strategy that returned 27.0% per year net of all costs for the 77 months 
from October 1987 to February 1994, with a yearly standard deviation of 8.0% 
during 1991-1994. 


Research was partly supported by the Social Sciences and Humanities Research Council of Canada 
and Frank Russell Company. 
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World wide security market regularities * 
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Abstract: This paper is a brief survey of systematic violations of security market efficiencies in the US, 
Japan and other world wide equity markets. These security market regularities or anomalies seem to 
occur because of cash flow, institutional constraints and policies, investor behavior and sentiment, the 
slowness of markets to react to new information, the timing of favorable or unfavorable information 
flows and market maker supply-demand balances and optimal bid—ask spreads. While such anomalies 
are controversial and difficult to measure precisely as well as being variable over time, the persistence of 
many of these market regularities is of considerable interest to researchers and financial traders. 
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Introduction 


One of the hallmarks of modern financial eco- 
nomics is the efficient market hypothesis. Is the 
current price the correct price? Is the current 
price useful in predicting future prices? Such 
questions abound. Research began in earnest in 
the early 1960s and a voluminous literature has 
evolved. Fama (1970, 1991) has surveyed the liter- 
ature and also pointed out the difficulty of defin- 
ing and testing for market efficiency. Such a test, 
as stressed by Roll (1977) if properly undertaken, 
is a joint hypothesis concerning efficiency and the 
validity of an equilibrium pricing model such as 
the capital asset or arbitrage pricing models. 

Over the years market efficiency has evolved 
from studying weak, semi-strong and strong form 
efficiency in which prices fully reflect price infor- 
mation, all publicly available information and all 
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information, respectively, to studies of the pre- 
dictability of returns based on fundamental vari- 
ables, event studies of the adjustment of prices to 
specific public announcements and tests for pri- 
vate information. The evidence is that future 
returns are predictable from past returns, divi- 
dend yields and term structure variables. This 
apparent violation of weak form efficiency of the 
constant expected returns model is confounded 
by the joint hypothesis problem of whether there 
is rational variation over time in expected returns 
or systematic deviations from fundamental value. 
Various security market regularities or anomalies 
can then be seriously analyzed as surveyed in the 
books by Dimson (1988), Haugen (1994), Thaler 
(1992) and Ziemba (1988, 1994, 1994a). 
Anomalies that yield higher short term returns 
seem to occur for six basic reasons: a) increased 
cash flows just prior to and during the anomalous 
period a portion of which is invested in equities; 
b) institutional constraints and policies such as 
pension funds investments made on the last day 
of the month; c) behavioral considerations such 
as investor sentiment leading to excess purchase 


0377-2217 /94/$07.00 © 1994 — Elsevier Science B.V. All rights reserved 


SSDI 0377-2217(93)E0179-2 


401 


402 Calendar Anomalies and Arbitrage 


W.T. Ziemba / World wide security market regularities 199 


or sale of equities in related but different securi- 
ties; d) the timing of favorable and unfavorable 
information flows such as the delays in reporting 
bad news; 5) the slowness of the market to react 
to new information such as earnings changes, and 
favorable or unfavorable publicly available rank- 
ing changes; and 6) market maker supply—de- 
mand balances and optimal bid—ask spreads. 
While arguments may be made that increased 
returns may be occurring because of increased 
risk, which is difficult or impossible to measure 
accurately, there is very strong evidence that most 
or all the gains in equity markets actually occur 
during a small subset of the year’s trading days, 
namely, the seasonally anomalous periods. Ritter 
and Chopra (1989) and Cadsby (1992) show, for 
example, that the only periods where higher risk 
as measured by the capital asset pricing model’s 
beta is rewarded with higher equity returns is 
precisely at the anomalous periods such as the 
trading day before holidays, at the end of the 
week, at the turn of the month, in the first two 
weeks of January for small stocks, etc. Event 
studies are more straight forward and less contro- 
versial since they are able to provide more clear 
cut evidence of the effect of the new information. 
Regarding strong form tests there is considerable 
evidence that corporate insiders have private in- 
formation that is not fully reflected in current 
prices. 

This is a brief survey of some of the evidence 
concerning departures from security market effi- 
ciency in the US, Japanese and other world wide 
equity markets. Capitalization varies year by year 
but one may conveniently think of the world’s 
equity markets as approximately: US two fifths, 
Japan a quarter, Europe a quarter and the rest of 
the world about one tenth. The United Kingdom 
is the world’s third largest market with about a 
third of Europe’s share. A survey of many of 
these topics appears in Hawawini and Keim 
(1994). Blume and Siegel (1992) survey various 
theories of security pricing and their relationship 
with the structure of markets. I do not specifically 
discuss why particular countries have had higher 
returns over time than others. Roll (1992) shows 
that these differences are largely explained by the 
diversification aspects of index construction, in- 
dustrial structure and exchange rate behavior. 
Ferson and Harvey (1992) show that lagged beta 
factors are useful in predicting returns in major 


security markets. Froot and Thaler (1990) and 
Eaker and Grant (1990) discuss currency hedging 
of international! portfolios. They show the advan- 
tage of selectively hedging only those currencies 
trading at a forward premium while not hedging 
currencies trading at a forward discount. Jorion 
(1989) shows the advantage of foreign investing. 
In particular, the optimal amount of foreign in- 
vestment is much greater than current alloca- 
tions. Departures from market efficiency are clas- 
sified here as seasonal regularities and cross sec- 
tional fundamental factors that explain equity 
returns. Of great interest is the robustness of the 
specific effect over time across various countries 
and a proper explanation of the reasons for the 
departures (if there is one) from efficiency how- 
ever, it may be defined. Merton (1985), Lo and 
MacKinlay (1990), Black (1986, 1992) and 
Markowitz (1992) discuss data mining and the 
dangers of searching for supposedly predictable 
trends from a large amount of data with many 
potential hypotheses without ex ante explana- 
tions. These dangers are real and are to be taken 
seriously in the application of results such as 
those that follow. 


Efficient market theory and the capital asset pric- 
ing model 


Theoretically, the value of a financial asset 
must equal the present value of its cash flows. 
Over a finite horizon, the present value of the 
terminal value should be added. Gordon and 
Shapiro (1956) and Gordon (1962) formalized this 
into the dividend discount model for the price p 
of the security: 


d 
Lg. 


p= 


assuming that the dividend rate d, the discount 
rate i and the growth rate of dividends g are 
constants. 

Such models and their generalizations to allow 
nonstationarity are useful but one must have good 
estimates of the parameters into the future and 
errors in i — g are magnified in the estimate of p. 
French and Poterba (1991) and Ziemba and 
Schwartz (1991, chapter 5) utilize such hair trig- 
ger models to value the Japanese stock market. 
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The British statistician Maurice Kendall postu- 
lated the efficient market hypothesis in 1953. 
Price changes were assumed to follow a random 
walk. Kendall found considerable support for the 
hypothesis by examining British stock prices as 
well as US commodity prices. Kendall’s work 
ushered in the efficient market era which is well 
represented by the important early studies by 
Roberts (1959, 1967), Black, Jensen and 
Scholes (1972), Cootner (1964), Fama (1965) and 
Mandelbrot (1963). This literature is surveyed by 
Fama (1970). 

An excellent recent survey of the past twenty 
years of voluminous activity studying efficient 
market theory and empirical tests is Fama (1991); 
see also the discussion in Blume and Sicgel (1992). 
The hypothesis’ theoretical underpinnings date at 
least to Bachelier in 1900 and the term random 
walk to Pearson in 1905. The capital asset pricing 
model developed by Sharpe (1964), Lintner (1965) 
and Mossin (1966) provided a method to price 
individual stocks in an efficient market. The ex- 


Table 1 


pected return on a stock was then simply the risk 
free return plus the stocks beta (covariance with 
the market relative to market variance) times the 
excess mean return of the market over the risk 
free rate. T-bills substitute well for the risk free 
asset. The CAPM was also the basis and motiva- 
tion for index funds since the market was essen- 
tially the index. It was John McQuown and Bill 
Sharpe’s early work at Wells Fargo in San Fran- 
cisco, plus the CAPM theory, that pioneered the 
current large scale worldwide activity in this area. 

Roll (1977) showed how difficult it was to 
properly test for market efficiency. A test of 
market efficiency and an equilibrium model of 
asset return must be made jointly. This also raised 
the question of the definition of the market and 
how to measure it. Betas, which give the sensitiv- 
ity of a stock to market movements, are only 
relative to the given market index used. 

The CAPM is a single variable model. Its 
introduction into modern portfolio theory had as 
much impact as Markowitz’s theory of portfolio 


Fundamental variable effects on US equity prices, univariate and multivariate 


Return factor Univariate effect 


Multivariate effect 


Multivariate effect Multivariate effect 


Monthly t-stat Monthly t-stat January t-stat Non-January t-stat 

mean mean mean mean 
Low P/E 0.59 3.4 0.46 47 0.09 0.5 0.49 4.7 
Small Size 0.15 2.3 0.12 2.7 0.14 1.3 0.12 25 
Yield —0.01 —0.1 0.03 0.5 0.67 3.4 —0.03 —0.4 
Zero Yield 0.00 0.0 0.15 1.3 1.00 1.9 0.08 0.6 
Neglect 0.14 1.9 0.10 1.7 0.36 1.8 0.08 1.3 
Low Price — 0.01 ~0.1 0.01 0.2 0.38 2.0 — 0.02 —0.4 
Book /Price 0.17 1.4 0.09 1.2 0.51 2.4 0.05 0.7 
Sales / Price 0.17 3.1 0.17 3:7 0.05 0.2 0.18 4.1 
Cash /Price 0.36 2.7 0.04 0.6 — 0.15 -2.0 0.05 0.8 
Volatility (S.D.) 0.16 0.6 0.07 0.6 0,62 2.1 0.02 0.2 
Beta —0.01 0.0 0.04 0.3 —0.05 -0.1 0,05 0.4 
Coskewness 0.09 0.6 0.04 0.7 0.10 0.5 0.04 0.6 
Controversy —0.33 =2i1 — 0.05 -0.8 —0.01 -(.1 — 0.06 —0.8 
Trend in Ear. Ests (— 1) ).48 48 0.51 8.1 0.60 3.8 0.50 A 
Trend in Ear. Ests (— 2) 0.40 4.4 0.28 4.9 0.25 1.6 0.29 4.7 
Trend in Ear. Ests (— 3) 0.29 3.0 0.19 3.8 0.13 0.6 0.19 3.8 
Earn. Surprise (— 1) 0.44 2.1 0.48 3.7 1.36 1.6 0.42 3.4 
Earn. Surprise (— 2) 0.47 1.8 0.18 0.8 0.14 2.0 0.18 0.7 
Earn. Surprise (— 3) — 0.03 siki =021 =11 -—0.01 0.0 —0.22 Ski 
Earn. Torpedo 0.00 0.0 — 0.10 at y d 0.08 0.3 — 0.12 -19 
Relative Strength 0.30 1.4 0.34 3.5 - 0.13 —0.2 0.39 1.0 
Res. Reversal (— 1) — 0.54 —4.9 — 1.08 —17.8 — 1.38 —6.0 1.06 —16.8 
Res. Reversal (— 2) —0.13 —14 — 0.37 -8.1 — 0.56 -25 — 0.35 7.7 
Short Term Tax = 0.08 —0.4 — 0.04 —0.4 0.38 18 — 0.08 —0.7 
Long Term Tax ~ 0.29 —1.6 0.00 -0.1 0.78 3.2 — 0.07 — 1.2 


Source: Jacobs and Levy (1988a). — 1 means one month lagged, etc. The effect is the mean change in expected return from a one 


standard deviation tilt in the direction of the given factor. 
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selection (1952, 1959). Markowitz’s contribution 
was to provide an operational way to measure 
risk and to compute efficient portfolios that maxi- 
mized return for fixed levels of risk. Both Sharpe 
and Markowitz were awarded the Nobel prize for 
their contributions to the field of financial eco- 
nomics in 1990. Various multifactor equilibrium 
models exist with the theoretical support greatest 
for Ross’s (1976) arbitrage pricing theory (APT). 
A survey of research work on the APT appears in 
Connor and Korajzyk (1994), We now turn to 
empirical work on multifactor models. 


Multiple factor models 


The various fundamental and seasonal effects 
are interrelated and it is useful to attempt to 


isolate the pure effects of particular variables on 
equity returns. Table 1 from Jacobs and Levy 
(1988a) shows this for various monthly adjusted 
fundamental factors for the US market using 
1970s and 1980s data. The univariate and multi- 
variate effects for twenty-five normalized vari- 
ables throughout the year and in and outside of 
January are isolated along with coefficient ¢-val- 
ues. For a variety of reasons, January is the 
highest return month for small capitalized stocks 
and provides a strong signal for all stocks as 
discussed below. Very significant factors are low 
price to earnings, smal! capitalization, high sales 
to price, high cash flow to price, trends in earn- 
ings (current and lagged one or two months) and 
residual reversal (a mean reversal concept). In 
January, the two strongest effects were return 


Table 2 
Univariate and multivariate fundamental variable effects on Japanese Tokyo stock exchange section 1 stocks, June 1979-August 
1989 
Factor Univariate effect Multivariate effect 

Rank Monthly t- Rank Monthly t- 

mean statistic mean statistic 

Earnings growth relative to price 13 0.00820 8.74 1 0.00788 7.85 
Last month’s excess return 11 — 0.00350 — 3.04 2 — 0.00693 — 3.65 
Relative return strength 26 — 0.00649 -RBS 3 — 0.00645 —3.42 
Current minimum past book value /price 19 — 0.00165 — 1.39 4 — 0.00397 — 2.87 
Current price relative to past highs 25 — 0.00279 — 1.47 5 0.00363 2.86 
Low P/E 22 — 0.00038 — 0.30 6 0.00385 2.69 
Small cap 29 — 0.00396 — 2.24 7 — 0.00429 -2.21 
Shares 27 — 0.00017 —0.07 8 0.00382 1.87 
Volatility (S.D.) 28 — 0.00434 = 3.02 9 — 0.00234 ~ 1.72 
Book/ price 18 0.00398 3.31 10 0.00124 1.68 
Acceleration of dividends /price 3 0.00065 1.31 11 0.00105 1.49 
Two year acceleration of dividends /price 10 — 0.00122 —0.18 12 — 0.00136 — 1.45 
Acceleration of dividend payment ratio 8 0.00110 0.94 13 0.00281 1.35 
Acceleration of relative strength 6 — 0.00233 — 1.92 14 0.00250 1.22 
Sales acceleration /price 21 — 0.00117 — 1.29 15 — 0.01645 = 1.20 
Beta 4 — 0.00104 — 0.45 16 0.00223 Lit 
Two year acceleration of dividend payout ratios 1 0.00013 0.12 17 — 0.00147 — 1,00 
Sales growth /price 21 — 0.00117 — 1.29 18 0.00109 0.93 
Percent of shares held by individuals 14 0.00240 2.50 19 0.00046 0.84 
Zero dividend factor wf 0.60613 1.88 20 0.00184 0.83 
Sales /price 24 0.00416 2.98 21 0.00072 0.67 
Next to last month’s excess return 12 — 0.00195 ~ 1.46 22 — 0.00846 — 0.63 
Acceleration of book value /price 15 0.00017 0.19 23 0.00097 0.60 
Earnings change /price 20 0.00094 0.96 24 0.00092 0.53 
Dividend /price 30 0.00233 2.50 25 0.00033 0.38 
Low price 23 — 0.00719 — 3.04 26 — 0.00068 - 0.34 
Excess return variable 2 ~ 0.00384 — 3.99 27 0.00025 0.19 
Acceleration of earnings/price 16 0.00263 3.25 28 0.00020 0.18 
Par variable 5 — 0.00370 — 0.02 29 — 0.00042 — 0.09 
Dividend /earnings 9 0.00033 0.22 30 — 0.00012 — 0.07 


Source: Ziemba (1990), Ziemba and Schwartz (1991). 
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Table 3 
Returns from use of model in Table 2 with yearly re-estimations 
Portfolio Value of ¥1 Annualized Monthly mean St. dev. of 
invested in Jan 85 return in % return, monthly returns 
at end of Sept 89 % 
Best 50 757 53.1 3.76 5.57 
Best 100 7.12 51.2 3.64 5.36 
Best 200 6.54 48.5 3.48 5.17 
Best 300 6.16 46.6 3.37 §.13 
Best 400 5.84 45.0 3.26 5.02 
Best 500 5.54 43.4 3.16 4.88 
Equally weighted 4.44 36.8 2.80 4.17 
TSE-I 
Nikkei 3.09 26.8 2.09 4.42 
TOPIX 2.96 25.7 2.04 4.96 


Source: Ziemba (1990), Ziemba and Schwartz (1991). 


reversals for stocks with long term tax losses and 
high returns to stocks with either high yields or 
no dividends at all. The coefficients estimate the 
mean returns for a one standard deviation tilt in 
the direction of the given factor. 

Table 2 from Ziemba and Schwartz (1991) 
based on Ziemba (1990) shows similar results for 
Japan for the period June 1979 to August 1989 
using monthly data for the first section of the 
Tokyo stock exchange. The multivariate and uni- 
variate effects of the variables with f-values are 
shown here. 

The most important factors explaining Japa- 
nese stock returns in the 1980s were similar to 
those in US markets, namely earnings growth 
projections, mean reversion variables, low price 
to book and to earnings and low capitalization. 

The fact that anomalies exist is born out by the 
fact that results similar to those in Tables 1 and 2 
form the basis of proprietary investment manage- 
ment models used to trade securities and create 
profitable hedges. For example, models using ele- 
ments similar to the results in Tables 1 and 2 are 
used in proprietary trading by Jacobs and Levy 
Investment Management and Buchanan Partners, 
Ltd, London, England, respectively. Several firms, 
most notably BARRA, in Berkeley, California, esti- 
mate and lease factor models for various coun- 
tries. The hope is that by using such models, 
superior returns will be obtained. Table 3 and 
Figure 1 provide the results of using the model 
defined in Table 2 readjusted year by year from 
1985 to September 1989 to rank the best stocks in 
Japan. For example, for stocks in 1987 the model 
was estimated using data up to December 1986. 


The results indicate that the portfolios of the 
highest ranking stocks have returns significantly 
higher than the portfolios of the Topix and Nikkei 
Stock Average market indices. Topix includes all 
Tokyo stock exchange section one stocks value 
weighted and the NSA is the price weighted 
average of 225 large companies. Because of its 
value weighting, the Topix is a larger capitalized 
index. Recent experience shows that models of 
this type have been quite effective even in the 
steep 1990-94 declines in the Japanese stock 
market. ' Essentially all the variable factors used 
in Tables 1 and 2 have been extensively studied 
by previous researchers in isolation or in small 
groups. 

An important and controversial research area 
is that of mean reversion and the time period of 
the reversion. Key references are Poterba and 
Summers (1988), Kim, Nelson and Startz (1991), 
DeBondt and Thaler (1985, 1987, 1989) and 
Thaler (1992). Some of this research is discussed 
below. The essence of the research is that securi- 
ties that have had very high or very low short or 
long term returns tend to have lower or higher 
returns than average, respectively, in succeeding 
time intervals. 


The small firm and January effects 


The size effect measured by market capitaliza- 
tion has been of particular interest. Key refer- 


1 The 1990-92 decline in Japanese stock and speculative land 
prices and their relationships is studied along with the past 
behavior of these markets in Stone and Ziemba (1993). 
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Excess returns of smallest market capitalization decile stocks over decile 2 to 10 stocks on NYSE by month, 1926 to 1987 


DeeNe Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Total (Jan—Dec) 
2 1.16 016 —0.05 —-0.40 0.05 0.00 0.00 0.09 0.04 0.06 = 0.32 0.20 1.52 
St 34 21 24 35 32 30 30 29 33 41 38 
3 1.46 0.09 0.02 -0.34 -0.21 —0.13 012 -0.18 0.16 —0.29 0.33 0.03 1.15 
46 38 29 24 25 28 30 32 32 30 41 31 
4 1.85 0.36 -0.11 -0.09 -0.05 -0.13 0.21 0.22 0.38 0.47 0.56 0.06 2.80 
49 31 32 27 30 29 31 37 34 21 39 29 
5 2.89 0.33 -0.22 -0.42 -0.16 -0.16 0.25 0.30 0.13 -0.60 0.39 0.16 2.89 
49 36 28 26 26 28 29 34 28 28 33 31 
6 3.43 0.29 -030 -0.36 0.09 -0.47 0.16 0.36 0.22 -0.90 0.58 -0.04 2.94 
53 36 31 25 25 24 32 31 31 28 35 30 
ri 4.11 0.60 -0.23 -0.23 0.28 -0.46 0.27 0.04 015 - 0.65 0.30 0.44 3.68 
52 40 34 20 28 23 31 32 33 27 31 25 
8 5.42 0.69 -0.47 —0.46 0.21 —0.69 035 -0.05 -0.23 -0.83 0.39 -0.65 3.69 
S1 40 29 24 25 24 32 26 32 29 32 28 
9 7.06 0.87 -0.06 —0.40 0.20 -0.71 0.45 0.13 —0.00 -0.91 0.22- —1.07 5.67 
56 36 34 25 27. 24 33 29 29 27 30 24 
10 10.27 0.94 -0.94 -0.30 0.37 —0.64 0.69 —0.07 0.03 -1.23 -0.27 —1.26 8.40 
57 32 26 23 29 24 32 26 30 24 23 24 


Source: Ibbotson and Associates (1988). 


ences for US studies are Banz (1981), Reinganum 
(1981), Roll (1983), Blume and Stambaugh (1983), 
Keim (1983), Jaffe, Keim and Westerfield (1989), 
Reinganum (1990), Jegadeesh (1992) and Dyl and 
Maberly (1992). Table 4 from Ibbotson and Asso- 
ciates shows the relative returns by month of 
various decile NYSE stocks from 1926 to 1987. 
While there is a substantial size effect (smallest 
minus largest decile returns) of 8.40% per year 
over these 62 years, the whole gain is in January 
which returned 10.27%. Indeed, the small firm 
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effect is strong in many years (for example 1962- 
1983, 1990-92) and very weak in others (for ex- 
ample 1983-85, 1988-90). However, the January 
excess returns of small over large capitalized 
stocks are positive in almost every year (also as 
discussed below small stocks have greatly outpre- 
formed during democratic administrations as 
shown by Hensel and Ziemba (1993b)). Table 4 
indicates that the smallest decile stocks had higher 
returns than the largest decile stocks in 57 of the 
62 years (and the last seven from 1988-94) 


8712 8806 8812 8906 8912 


Figure 1. Cumulative performance of ¥ 1 invested in the best 50, 100, 200, 300, 400, 500 and all equally weighted TSE-I stocks 
versus the Topix and NSA225, monthly revisions of data and modcl, January 1985 to September 1989. Source: Ziemba (1990), 
Ziemba and Schwartz (1991) 
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Table 5 0.0150 
Mean return difference between equally and value weighted 0.9125 
indices omon 
0.0075 
Last day January trading day 0.0050 
of during 1963-1978 0.0025 
December First Second Third Fourth 0.0000 
- neo 
Mean return 0.5647 1.186 0.6067 0.6107 0.4527 eee 10-9 -8 -7 6 -5 -4-3 22-1123 45678 9 
t-statistic (4.72) ' (8.39) (3.86) (3.96) (3.05) 


a 


t-statistics in ( ). 
Source: Roll (1983). 


whereas the outperformance in other months is 
not statistically significant and indeed there is no 
outperformance at all. A figure showing the more 
recent years appears in Ziemba (1994). Table 5 
shows the small stock premium measured as the 
difference between equally and value weighted 
indices of all NYSE stocks for 1963-78 for trad- 
ing days — 1 to +4. The mean difference in these 
five days was over 3% during this 16 year period. 

Figures 2 and 3 supplement this and show that 
for the period 1971-85 small stock returns greatly 
exceeded large stock returns on trading days — 1 
to +9 in January and much of the explanation 
for this difference (R? = 0.48) is the difference in 


H january O Other Months 


Figure 2. Excess returns on small stocks, mean daily percent 
returns, smallest minus largest decile, 1971-85. Source: Ritter 
(1988) 


the buy-sell ratios of individual investors. * There 
is no such effect in any other month. A survey of 
the small firm/January effect literature along 
with methods to invest in this regularity in the 
futurcs markets since 1982 appears in Clark and 
Ziemba (1987) which was updated to 1993-94 in 
Zicmba (1994b). They document the success of a 
strategy of going long the small capitalized stocks 


? Asaji Komatsu, of the Yamaichi Research Institute in Tokyo, 
and the author investigated the explanatory power of the 
buy-sell ratios of individual investors on Japanese security 
returns. This variable explained about 11% of the variation 
in the excess returns of small versus large capitalized stocks 
using data from the 1970s and 1980s. 


Table 6 
International evidence of seasonality in the size premium 
Country 
Belgium Finland France Japan Taiwan UK 
Test period 1969-1983 1970-1983 1968-1980 1965-1987 1979-1986 1958-1982 
Number of securities 170 40 201 First section 53 to 72 All LSE 
TSE 
Number of size portfolios 5 5 5 10 5 10 
Market value largest portfolio 
5 SEE 188 113 83 NA 17 182 
Market value smallest portfolio 
January return (%) on: 
Smallest portfolio 5.4 5.9 3.7 9.5 NA 2.3 
Largest portfolio 3.0 2:5 44 2.3 NA 3.6 
January size-premium 2.4 3.4 —0.7 7.2 3.4 213. 
Rest-of-the-year monthly return (%) on: 
Smallest portfolio 0.8 1.8 1.4 1.9 NA 1.2 
Largest portfolio 0.4 1.0 0.7 1.3 NA 0.7 
Rest-of-the-year size-premium (%) 0.4 0.8 0.7 0.6 0.3 0.5 


TSE = Tokyo Stock Exchange; [SE = London Stock Exchange; NA = Not Available. 

All monthly mean returns are significantly different from zero at the 5% level. 

The January size-premium is significantly different from zero only in Belgium, Finland, Tatwan and Japan. 
Monthly mean returns are not significantly different from zero for the largest portfolio. 


Source: Hawawini and Keim (1994) from original data. 
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Figure 3. Mean buy-sell ratio of individual investors at Mer- 
rill Lynch, 1971-85. Source: Ritter (1988) 


and short the large caps using the Value Line and 
S&P500 futures contracts. While the futures 
market seems to anticipate the effect in mid to 
late December, the cash effect has remained very 
similar to that in previous years and has not been 
anticipated by those knowledgeable in trading 
this effect in the futures markets. Indeed the 
years 1990/91 to 1993/94 have produced very 
strong effects with 1992/93 the largest in the past 
two decades and 1993/94 nearly as strong. This is 
consistent with the evidence in Hensel and 
Ziemba (1993b) that the small capitalized stocks 
have outpreformed large capitalized stocks when 
democrats are in the White House. The advan- 
tage, which seems to hinge on the different ap- 
proach of Democrats and Republicans to small 
and large businesses, has been greatest in the 
first year of administrations. This small minus 
large stock advantage during democratic adminis- 
trations averaged 15.12% in the first year, 8.25% 
in the first two years and 7.87% over the four 
years from 1937-1992 and even more during the 
period 1929-1992. 

The small firm effect and its relationship to 
the January effect has been studied in numerous 
countries. Table 6 from Hawawini and Keim 
(1994) shows results from Belgium, Finland, 
France, Japan, Taiwan and the United Kingdom. 
The January size premium was significantly dif- 
ferent from zero in Belgium, Finland, Taiwan and 
Japan. However, there was no size premium in 
France and the United Kingdom. The data used 
for these studies while fairly extensive ends at the 
beginning of the 1980s. In this table all monthly 
returns are significantly different from zero ex- 
cept the largest portfolio outside of January. Fig- 
ure 4 shows the January size premium in twenty 
one countries during 1991 and 1992. There was a 
January size premium in the US in these years 
but this was not world wide. 


Using data up to 1989 indicates that the size 
effect in Japan seems to occur twice each year 
with each excess of small over large stock returns 
occurring at the time of the semi-annual Decem- 
ber and June bonuses. Each of these bonuses 
amount to about one sixth of a typical worker’s 
annual salary and some of these funds may well 
price pressure small capitalized stock because the 
increased buying of stocks with low volume in- 
creases their prices. The evidence in Ziemba and 
Schwartz (1991) and Kato (1991) is that the size 
effect on the first section of the Tokyo stock 
exchange from 1969 to 1987 was 8.6% in January 
and February and 8.2% in April to July. For 
anomalies to persist, the reason for them such as 
increased cash flows into stocks during various 
periods must continue. During the steep decline 
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Figure 4. Difference between small and large cap stocks in 
local currency in January, 1991 and 1992. Source: Morgan 
Stanley data 
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and high volatility in the Japanese stock market 
from 1990 to 1993 individual investors have been 
simply afraid to buy stocks. Their cash flows have 
gone into bounds and cash instruments not into 
stocks. Indeed Bell and Ziemba (1993) found that 
the only anomaly that was still present in Japan 
in the this period was the fact that all gains in the 
stock market are at night in the close to open 
returns. The absence of anomalies has modified 
the behavior of Japanese security markets. 


Low price to earnings, sales, book value and cash 
flow effects 


Low price to earnings stocks are natural 
choices for investors since the major reason to be 
in business is to generate earnings. Early propo- 
nents of such strategies were the famous value 
investors Graham and Dodd (1934). Studies by 
Nicholson (1960), Basu (1977, 1983), Reinganum 
(1981), Peavy and Goodman (1983), Jaffe, Keim 
and Westerfield (1990), Fama and French (1991) 
in the US, Levis (1990) in the UK, Aggarwal, Rao 
and Hiraki (1988) and Ziemba and Schwartz 
(1991) for Japan, Chou and Johnson (1990) and 
Ma and Shaw (1990) for Taiwan, and Gillan 
(1990) for New Zealand find significant price 
earnings effects. The highest return strategies are 
usually these that invest in stocks with low price 
earnings ratios relative to their own historical 
values and those of stocks similar industries. 
Strategies that attempt to eliminate general mar- 
ket risk by going long such stocks and short the 


Table 7 


high PER stocks generally have high returns and 
low risk. Peavy and Goodman (1983) found this 
to be 20.8% per year during 1970-80. Hawawini 
and Keim (1994) using data from 1962 to 1989 
show that although the lowest price earning ratio 
stocks have the highest average returns in total or 
risk adjusted using the capital asset pricing model 
securities with negative earnings have even higher 
average returns; see Table 7. But these returns 
have considerable risk. For example, negative 
earnings stocks have average betas of 1.46 versus 
1.01 for the lowest price earnings ratio stocks. 
These stocks also have low prices and low capital- 
ization. After risk adjustment they still have very 
high returns but they are significantly below the 
lowest decile price earnings ratio stocks. Negative 
earnings stocks are also highly correlated with 
small capitalized effects which again brings back 
the advantage of analyses like Tables 1 and 2. See 
also Jaffe, Keim and Westerfield (1989). 
Ettredge and Fuller (1991) investigated this 
further concentrating on the risk aspects. Using 
data from 1973-82 on the NYSE and ASE they 
find that stocks with negative earnings on April 1 
and held for one year have mean returns that 
average 11.8% above the market index. Large 
earnings gains, which average 34.8%, seem to be 
the reason behind the increase in stock prices. 
The 11.8% excess gain can be decomposed into a 
7.7% negative earnings effect and a 4.1% size 
effect. Repeat losers have even higher mean re- 
turns than first time losers. They had mean excess 
returns of 19.5% above the market index. The 
effects of firms and second time losers seems to 


Mean monthly returns and other attributes for the NYSE and AMEX from April 1962 to December 1989 using portfolios formed 
on the basis of the ratio of price /earnings created on March 31 and revised after each twelve months 


P/B Mean Std. dev. Beta Mkt. cap. P/B Price ($) 

portfolio return return ($mil) 

Negative 1.55 8.68 1.46 141.8 —37.7 10.57 

Lowest 0.79 5.61 1.09 888.4 27 37.55 
2 0.90 5.06 1.05 938.4 5.0 38.02 
3 0.91 4.94 1.03 849.1 6.4 35.38 
4 0.87 4.60 0.95 913.9 7.5 31.91 
S 0.79 4.81 0.98 913.5 8.5 31.22 
6 0.97 4.65 0.94 808.0 9.4 29.68 
7 1.05 4.69 0.96 682.3 10.5 28.11 
8 1.13 4.64 0.87 777.8 11.7 27.47 
9 1.34 4.65 0.89 656.5 13.3 25.07 

Highest 1.25 5.48 1.01 569.8 18.9 21.82 


Source: Hawawini and Keim (1994). 
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Table 8 


Mean monthly returns and other attributes for the NYSE and AMEX from April 1962 to December 1989 using portfolios formed 
on the basis of the ratio of price /book created on March 31 and revised after each twelve months 


P/B Mean Std. dev. Beta Mkt. cap. P/B Price ($) 

portfolio return return ($mil) 

Negative 1.66 8.37 1.29 119 -7.07 13.62 

Lowest 1.49 5.63 1.04 261 0.57 15.98 
2 1.46 4.98 0.95 401 0.83 19.99 
3 1.08 4.66 0.90 620 1.00 23.56 
4 1.12 4.32 0.83 668 1.15 25.72 
5 0.96 4.56 0.90 641 1.31 26.94 
6 0.82 4.45 0.91 835 1.52 29.97 
7 0.86 4.79 0.98 752 1.80 31.45 
8 0.93 4.87 1.02 813 2.20 33.93 
9 0.85 5.25 1.11 1001 2.95 37.48 

Highest 0.91 5.19 1.05 1430 8.17 46.66 


Source: Hawawini and Keim (1994). 


be caused by bad market sentiment and their 
subsequent bounce and lower estimates of earn- 
ings then reality. 

Evidence of a low price to sales effect has 
been found for the US market by Senchack and 
Martin (1987) and for Japan by Aggarwal, Rao 
and Hiraki (1989). Hawawini and Keim (1994) 
present evidence on price to cash flow (earnings 
plus depreciation). Finally several authors includ- 
ing Stattman (1980), Rosenberg, Reid and 
Lanstein (1985), Keim (1988), Fama and French 
(1991) and Hawawini and Keim (1994) have found 
statistically significant evidence of a low price to 
book value per share effect. Hawawini and Keim, 
using data from April 1962 to December 1989, 
found the results in Table 8. There was a strong 
price to book effect in this period with the lowest 
P/B stocks returning 1.66% per month with lower 
beta versus 0.85% for the highest P/B stocks. As 
with the low price earnings and low price to cash 
flow, see tables in Hawawini and Keim (1994), 
negative price to book equities have the highest 
returns. This latter is an example of the winner- 
loser effect studied by DeBondt and Thaler (1985, 
1987); see also Ball and Kothari (1989), Fama and 
French (1988a), Chan (1988), Zarowin (1989, 
1990), Thaler (1992), and Chopra, Lakonishok 
and Ritter (1992). There is considerable evidence 
that US stocks that have had very poor perfor- 
mance over a long period, such as five years, have 
much higher returns in subsequent years, espe- 
cially in January. However, there seems to be a 
difference between extreme winners and losers 
over one versus five year periods. While the evi- 


dence points strongly to reversals for the latter, 
the former seem to continue their losing or win- 
ning trends, see Jegadeesh and Titman (1991). 

Similar results do not seem to occur in Canada. 
Kryzanowski and Zhang (1992) found that the 
continuation hypothesis holds for the following 
one and two years so that losers continue to lose 
and winners win. But there is no reversal over 
subsequent five to ten year periods. 

Figure 5 shows the effect using monthly return 
data for all NYSE stocks from 1926 to 1982. 
Porfolios of the 35 most extreme losers and win- 
ners were formed over five year periods. These 
portfolios returns were evaluated for the follow- 
ing five years. This was done year by year as well 
as through an averaging process. The returns for 
both winners and losers are mean reverting. The 
subsequent outperformance of the losers is much 
greater than the subsequent underperformance 
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Months After Portfolio Formation 


Figure 5. Cumulative excess returns for winner and loser 
portfolios. Source: De Bondt and Thaler (1985) 
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for the winners. One can see the sharp upward 
return movements in the five Januarys on the 
graph. Differences in beta risks do not explain 
the phenomenon. The losers are, on average, 
smaller firms and also have low price and low 
price to earnings, etc. Space limitations prohibit a 
full discussion of the interaction of the various 
effects and also their relationship with seasonality 
effects which I now turn to. On this see Jaffe, 
Keim and Westerfield (1989), Fama and French 
(1991), Hawawini and Keim (1994) and the re- 
sults in Table 1 and 2. See also Reinganum (1988) 
who has studied factors that lead to extremely 
higher medium term returns for specific stocks. 


Seasonality effects 


There is considerable research on seasonality 
patterns in US, Japan and other world wide secu- 
rity markets. The focus has been on day of the 
week, holiday, turn of the month and year and 
related effects. As with the fundamental vari- 
ables, the effects are interrelated. Hence, to un- 
derstand them better, isolated analysis plus joint 
effects research is suggested. In the case of fun- 


damental variables, one can estimate, as in Ta- 
bles 1 and 2, multivariate equations to attempt to 
use the various effects to improve on investment 
predictions concerning individual securities. For 
the seasonal effects, one may construct seasonal- 
ity calendars that attempt to improve on invest- 
ment prediction for market indices such as small 
and large capitalized. There is some evidence 
that taken together such calendars can provide 
forecasts that separate out the days of the year 
into the most and least favorable for stocks with 
intermediate gradations. Ziemba and Schwartz 
(1991) have attempted to do this for Japan follow- 
ing on the work in Ziemba (1988, 1994a). A 
fifty-two variable model was developed to at- 
tempt to capture day of the week, holiday, Jan- 
uary, turn of the year, turn of the month, monthly, 
Golden Week, and the small firm effect. The 
t-statistics are very high leading to a rejection of 
the hypothesis that the days ranked +1 to +4 
(i.e., one standard deviation above average or 
more) have lower mean returns, than the days 
ranked —1! to —4. The model was estimated in 
several ways for the years 1984, 1985,..., 1989 
using data for 5, 10, 20 or all the data back to 
1949, using all coefficients or all coefficients that 


Table 9 
Summary of the seasonality calendar results, January 1988 to end of January 1989 
Rank Sample Mean daily 
size returnNSA225 

-4 14 0.213 
-3 19 — 0.129 
SA 6 0.281 
=} 7 0.104 

0 27 -0.111 

1 79 0.132 

2 71 0.067 

3 30 0.154 

4 40 0.478 

Categories to be compared: 

4 3 70 0.339 
-4 -3 33 0.016 

2 l 150 0.102 
2 el 13 0.186 

4 3 70 10.339 

2 1 150 0.402 
=2, 4 13 0.186 
-4 -3 33 0.016 

4 3 2 1 220 0.177 
S43) S20 a. 46 0.064 


Mean daily St. dev. St. dev. 
return Topix on NSA on Topix 
0.166 1.455 0.409 
— 0.080 0.517 0.538 
0,240 0.668 0.732 
0.002 0.781 0.834 
—O.117 0.675 0.687 
0.081 0.627 0.659 
0.085 0.650 0.676 
0.149 0.610 0.595 
0.512 1.060 1.224 
0.356 0.904 1.014 
0.025 0.514 ().496 
0.083 0.637 1.665 
0.112 0.706 0.765 
0.356 0.904 1.014 
0.083 0.637 0.665 
0.112 0.706 0.765 
0.025 0.514 0.496 
0.170 0.738 0.801 
0.049 0.572 0.577 


Source: Ziemba and Schwartz (1991). 
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Table 10 
International evidence of daily seasonality in stock market returns in percent 
Country Period Index > Mon Tues Wed Thurs Fri Sat 
Australia 1975-84 EW1 0.044 —0.116 ** 0.450 0.198 * 0.157 * 
Belgium 1977-85 vwl 0.098 * — 0.032 0.041 * 0.111 * 0.130 * 
Belgium 1977-85 EWI 0.080 * 0.026 * 0.046 * 0.044 * 0.062 * 
Canada 1969-84 TC — 0.157 * — 0.003 0.073 * 0.075 * 0.094 * 
Finland 1977-82 VWI 0.086 * 0.066 * 0.030 0.070 * 0.074 * 
France 1977-89 vwl — 0.050 0.139 * 0.126 * 0.133 * 0.100 * 
France 1977-89 EWI 0.083 * 0.117 * 0.122 * 0.117 * 0.013 
Germany 1970-85 Vwi — 0.085 * 0.008 0.044 0.060 * 0.099 * 
Germany 1970-85 EWI 0.005 — 0.026 0.031 0.066 0.107 * 
Greece 1978-86 Vwi 0.070 * — 0.058 — 0.073 * 0.041 0.160 * 
Japan 1949-88 vwi — 0,071 * —0.044 * 0.115 * 0.081 * 0.042 * 0.133 * 
Korea 1980-84 Vwi —0.072 * — 0.087 * 0.087 * 0.014 * 0.120 * 0.230 * 
Singapore 1969-84 ST — 0.036 — 0.107 * 0.079 * 0.121 * 0.100 * 
Spain 1979-83 VWI — 0.072 * 0.003 0.037 0.071 * 
UK 1969-84 FT-A — 0.095 * 0.106 * 0.090 * 0.011 0.044 
US 1928-52 S&P —0.223 * 0.076 * 0.084 * 0.066 0.029 0.147 * 
US 1952-82 S&P — 0.154 * 0.026 0.103 * 0.036 * 0.092 * 


è * Indicates returns that are significantly different from zero at the 5% level, VMI and IWI indicate value and equally weighted 


indices, respectively. 


> TC = Toronto composite index. The Tokyo stock exchange is closed on Saturdays since February 1989. ST = Straits Times index. 


FT-A = Financial Times all share index. 
Source: Hawawani and Keim (1994). 


were statistically significant at the 20% or 5% 
level. In all cases the higher ranked days such as 
those before holidays, in January, at the turn of 
the month, etc. had higher returns than those in 
the second half of the month, on Mondays or in 
October. The results for January 1988 to the end 
of January 1989 appear in Table 9 using all the 
data and all the coefficients. The hypothesis that 
the +3 and +4 days have returns higher than the 
—3 and —4 days is significant for the NSA at the 
94% level and for the Topix at the 92% level. 


Generally speaking the results are as expected 
but standard deviations are high and many hy- 
potheses are not significant at conventional levels 
for this short period. 

There is strong evidence that stock prices fall 
on Mondays and rise on other days. Table 10 
from Hawawini and Keim (1994) shows the evi- 
dence from 13 countries. An interesting observa- 
tion is that all the 1929-32 stock market decline 
some 87% from peak to valley occurred on week- 
ends and on the following Monday. There were 


Table 11 
International evidence of seasonality in stock market returns, monthly mean returns, 1959-1979 
Country Jan Feb Mar Apr May Jun July Aug Sept Oct Nov Dec All 
Australia 2.65 —0.58 0.51 0.84 0.97 0.43 0.66 -037 —2.39 2.13 — 0.85 3.99 0.67 
Belgium 3.20 1.09 0.40 1.48 — 1.36 —0.84 144 —-1.17 =187 -0.69 0.42 -0.09 0.17 
Canada 2.90 0.07 0.79 0.41 —0.9  —0.30 0.69 0.60 —0.06 — 0.82 1.44 2.61 0.61 
Finland 3.62 2.09 2.63 0.95 —0.02 1.55 2.53 0.97 -0.76 0.77 0.61 2.51 0.45 
France 3.72 —0.18 1.98 0.94 -0.66 -1.90 1.53 1.03 -1.21 - 0.72 0.43 0.15 1.43 
Japan 3.53 1.13 1.88 0.30 0.90 2.06 -0.32 -0.83 -0.13 -0.98 1.65 1.80 0.92 
Spain 2.24 1.29 0.32 1.59 — 1.87 0.06 0.79 1.29 -—1.64 0.19 -044 -0.01 0.32 
UK 3.40 0.69 1.25 3.13 SEA SEI SLN 1.88 0.24 0.80 -0.61 2.06 0.85 
UK FT-A 3.06 0.79 1.15 3.57 —1.00 ~—0.85 —0.22 2.62 0.03 1.26 0.16 2.34 1.08 
US 1.04 —-0.41 1.27 0.96 -1.38 -0.56 0.14 0.34 -0.79 0.78 1.03 1.42 0.32 
NYSE, 5.08 0.55 1.55 044 -142 -1.00 0.73 0.72 -042 -0.79 1.79 1.37 0.72 
equally 
weighted 


Source: Gultekin and Gultekin (1983). 


Chapter 18: Worldwide Security Market Regularities 413 


210 W.T. Ziemba / World wide security market regularities 


gains on average from Monday’s close to Friday’s 
close. Monday’s decline seems to have an influ- 
ence from the existence of a two day trading 
break. Table 12 indicates that along with holi- 
days, returns are high before the break in trading 
and low after it. Ziemba (1993) investigated the 
difference between one and two day weekends in 
Japan that is with or without Saturday trading. 
Returns are negative on Mondays after two day 
breaks and negative on Tuesdays after one day 
breaks. Accumulation of sell orders by individual 
investors when they have time to formulate them 
on weekends is offered as the cause for this 
effect. Osborne (1962) and Miller (1988) argued 
this for US markets as well. Lakonishok and 
Maberly (1990) have provided a test with NYSE 
odd-lot sales and purchases, sales and purchases 
of cash-account customers of Merrill-Lynch and 
NYSE block transactions. They found that Mon- 
day had the lowest trading volume. Institutional 
trading is the lowest on Monday of all trading 
days but individual trading on Monday is the 
highest relative to other days of the week. Indi- 
viduals tend to sell more on Mondays. For exam- 
ple, odd-lot sales minus odd lot purchases rela- 
tive to NYSE volume was 29% higher during 
1962-86 on Monday than the average of Tuesday 
to Friday. Theoretical arrangements that lead to 
imbalances on different days in trading volume, 
mean returns and volatility based on the interac- 
tion of various traders have been made by Ad- 
mati and Pfleider (1988, 1989). 

Interestingly as soon as Saturday trading ended 
in Japan the end of January 1989, the day of the 
week effect became like the US with larger de- 
clines on Mondays and rises on Tuesdays. See 
Table 9 and Ziemba and Schwartz (1991). 

Stock prices tend to rise in January and the 
early part of the year and then at the end of the 
year with May and June and especially Septem- 
ber and October being weak months. Table 11 
from Gultekin and Gultekin (1983) shows the 
effect in ten major stock markets for the twenty 
one years from 1959 to 1979. 


Holiday effects 


There has been a very strong holiday effect in 
US markets throughout the twentieth century. 
Ariel (1984, 1990), Zweig (1986), Lakonishok and 


Smidt (1988) have documented this. For example, 
for the ninety years from 1897 to 1986, Lakon- 
ishok and Smidt found that fully 51.5% of the 
non-dividend returns on the Dow Jones industri- 
als were made on the approximately eight prehol- 
idays. Figure 6a shows the mean returns on the 
eight preholidays for the S & P500 from 1980-92. 
Ariel using data from 1963-82 found a very strong 
effect with the average preholiday having returns 
that were about 23 times an average day for large 
capitalized stocks measured by a value weighted 
index of all NYSE stocks. Small capitalized stocks 
(equally weighted NYSE stocks) had returns 14 
times larger but since this period was one of 
extremely high small stock returns, the actual 
returns exceeded the large capitalized securities. 
Lakonishok and Smidt also found that preholi- 
days are associated with higher mean returns on 
all days of the week compared to average returns 
those days. Investigation of the holiday effect in 
Japan by Ziemba (1991) yields very similar re- 
sults. Using daily data on the Nikkei stock aver- 
age from May 1949 when the market opened up 
after World War II to 1988, he found that the 
typical preholiday had returns of about five times 
the average non-preholiday trading day, namely 
0.246% versus 0.0489%. 

A regression to separate out the effects on 
trading days —3 to +2 around a holiday using 
0-1 variables led to the daily return 


R = 0.0352 + 0.0799 day_, + 0.0222 day_, 


(3.745) (1.491) (0.424) 
+ 0.1894 Day_,— 0.0663 Day, , 
(3.709) (- 1.334) 
+ 0.00114 Day ,,. 
(0.023) 


The only day whose return effect is statistically 
different from zero was the preholiday. The day 
after holidays has negative returns which are sig- 
nificant at low confidence levels. Investigation of 
the relationship between the holiday and day of 
the week effects led to the results in the tables. 
Preholidays when lumped with pseudo-preholi- 
days namely Fridays and Saturdays preceding 
market closings have high returns and a 10.27 
t-statistic. With this additional data, the post holi- 
days and Mondays have statistically significant 
negative returns, see Table 12. The preholidays 
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Figure 6(a). Mean percentage returns on the preholidays for the S & P500, 1980-92. Source: The Economist (1992) (b) Holiday 
effects in the UK, mean daily percent returns. 


had higher returns on every day of the week as 
shown in Table 13. Moreover, the returns are 
consistently positive in months with high mean 
returns in Japan such as January but not so for 
the lower mean months of September and Octo- 
ber; see Table 14. Finally an investigation of the 
futures markets effects by Ziemba (1989) led to 
the conclusion that the futures markets during 
1986-88 anticipated the holiday effect on the 
third trading day before the holiday and this 


anticipation led to price rises on the —3 day to 
take some of the expected preholiday gains. 

The period in late April to early May is re- 
ferred to as Golden Week in Japan. There were 
holidays on April 29 and May 3 and 5 during the 
sample period data in Ziemba (1991). The prehol- 
idays all had very high mean returns: 0.215% on 
April 28, 0.536% on May 2 and 0.326% on May 4. 

The holiday effect seems to be similar in the 
UK as well. Table 15 and Figure 6b show results 


Table 12 
Holiday effects on the NSA, 1949-1988 
Day Sample Minimum Maximum Mean Standard t- 
size return, % return, % return, % deviation, % statistic 

Pre-holiday 408 —5.41 3.06 0.2461 0.79 6.26 
Non pre-holiday 7143 — 14.90 11.29 0.0480 0.98 4.22 
Days after holidays 408 — 8.69 4.71 0.0068 0.99 0.14 
Pre-holidays, 2268 — 6.61 6.88 0.1561 0.72 10.27 

Fridays and Saturdays 

preceding market closing 
Days after holidays 2268 — 8.69 5.39 — 0.0480 * 0.95 — 2.41 


and weekends 


Source: Ziemba (1991). 
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Table 13 
Mean daily return, standard deviation and percent positive returns on pre-holidays by days of the week of the NSA, 1949-1988 
Sample Mean t- Positive 
size return, % statistic returns, % 
All Mondays 1937 — 0.0589 -2.73 50.9 
All Tuesdays 1959 — 0.0419 — 1.93 47.4 
All Wednesdays 1960 0.1164 5.30 56.9 
All Thursdays 1959 0.0871 3.77 55.1 
All Fridays 1959 0.0544 2.62 54.6 
All Saturdays 1755 0.1409 8.38 61.9 
Mon. before holidays 59 0.3106 3.10 76.3 
Tues. before holidays 59 0.0103 0.10 57.6 
Weds. before holidays 58 0.1600 1.17 69.0 
Thurs. before holidays 60 0.1868 2.82 733 
Fri. before holidays 71 0.3821 4.05 74.6 
Sat. before holidays 101 0.2758 4.44 64.4 
1° 256 0.2097 4.58 62.5 
2 1703 0.0310 1.36 53.4 
3 18 0.2648 1.38 77.8 
4 56 0.1895 * 2.53 66.1 


a | = Fridays before a no-trading Saturday; 2 = Fridays before a trading Saturday; 3 = Fridays before a no-trading Saturday and 
no-trading Monday; 4 = Saturdays before a no-trading Monday and Saturdays before a Monday holiday. 
Source: Ziemba (1991). 


from the cash and futures markets from 1984 to 
1992. The preholidays have returns that average 
about five times a typical day. As in the US and 
Japan, there is some anticipation in the futures 


The turn of the month effect 


There is strong evidence for a turn of the 
month effect in US, Japanese and other security 


markets that tends to shift the return distribu- 
tions. Arbitrage trading is less active in the UK 
than in the US and this may be the reason that 
the effect is less pronounced in the UK. As in the 
other countries, the after holidays combined with 
Mondays are negative. 


markets. Market advisors such as Merrill (1966), 
Fosback (1976) and Hirsch (1986) have argued 
that stocks advance at the turn of the month. The 
five days consisting of the last trading day of the 
previous month (—1) and the first four trading 
days of the new month (+1 to +4) are referred 


Table 14 
Mean daily return, standard deviation on pre-holidays, NSA, 1949-1988 
Holiday Advances Mean Minimum Maximum t- 
return, % return, % return, % statistic 
January 1 32/39 0.2457 — 3.38 1.27 2.01 
January 15 32/39 0.4711 — 2.04 1.95 4.11 
February 11 15722 0.3011 — 0.42 1.81 2.65 
March 21 21/39 0.0687 =150 1.91 0.69 
April 29 25/39 0.2272 — 2.96 2.73 1.71 
May 3 31/39 0.4989 — 0.74 3.06 4.65 
May 5 30/39 0.3884 — 0.66 2.19 3.92 
September 15 14/22 0.0154 = 2.48 0.67 0.10 
September 23 27/39 0.2341 — 0.92 1.31 3.00 
October 10 12/22 — 0.2487 —541 1.26 —0.81 
November 3 24/39 0.2987 -1.27 2.77 2.15 
November 23 25/39 0.2414 -2.18 2.59 1.55 
December 23 


Source: Ziemba (1991). 
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Table 15 
Holiday effects in the UK 


Day * Cash market Futures market 
Data period: May 4, 1984 to May 10, 1992 Data period: May 4, 1984 to July 17, 1992 
Sample Average t-stat Sample Average t-stat 
size return % size return % 
PH 49 0.198 1.912 50 0.182 1.097 
AH 49 0.044 0.365 50 0.103 0.573 
AH&M 427 —0.095 — 1.861 437 — 0.078 — 1.130 
PH&FRI 428 0.144 3.544 438 0.079 1.477 
Other 1928 0.410 1.911 1975 0.031 1.190 
? PH = preholiday (- 1); 
AH = after holiday (+1); 
AH&M = Monday; 
PH&FRI = Friday; 
Other = rest of days. 


to as the turn of the month. Ariel (1987) docu- 
mented this for small land large capitalized stocks 
measured by equally and value weightings, re- 
spectively, of all NYSE stocks from 1963 to 1981, 
see Figure 7. See Table 5 for the January returns 
on these days from 1963 to 1979. 


0.25 { 
0.20 


0.10 Mean 


9-8 7 e -2-11 23 45 6 7 89 


-0.05 


-0.140 4 
987654327123 456789 
Figure 7. The US turn of the month effect, mean daily 
percent returns on trading days —9 to +9, 1963-81. Source: 
Ariel (1987) 


The rest of the market gains during 1963-81 
occurred in the second week of the month. The 
first half of the month, namely trading day —1 to 
+9 had all the gains and the second half of the 
month had no returns at all. For these 19 years, 
Ariel (1987) found the following portfolio gains: °? 


Equally Value 


weighted weighted 
First half of trading month 2552.40% 565.40% 
Last half of trading month 0.28%  -33.80% 
Nineteen years 2545.90% 339.90% 


Lakonishok and Smidt (1988) reported that 
over the ninety year period 1897-1986, the large 
capitalized Dow Jones industrials rose 0.475% 
during the four day period —1 to +3 each month 
whereas the average gain for a four day period is 
0.061%. There was an increase in prices over 
56% of the time. The average gain per month 
over these 90 years was 0.349%. Hence aside 
from these four days at the turn of the month, the 
DJIA actually fell. 

The effect has continued in recent years even 
in the presence of index futures contracts which 
have been trading since 1982. Hensel, Sick and 
Ziemba (1994) found the results presented in 
Table 16 and Figure 8. Consistent with the previ- 
ous evidence about two thirds of the month’s 
gains occur on trading days —1 to +4, the turn 
of the month, and the rest of the month’s gains 
occur on trading days +5 to +9 so that all or 


> One also sees the strong small firm effect during these years 
as the equally weighted index rose over seven times as much 
as the value weighted large capitalized index. 


Table 16 


The US turn of the month effect: average daily returns and ¢t-values for the hypothesis that returns are greater than zero and the mean return, respectively, in the cash and 
futures markets for small and large capital stocks during the turn of the month, first half of the month, and rest of the month. May 1982—April 1992 


Average daily returns (%) 


t-values (Ho: return = 0) 


t-valucs (Ho: ret diff = 0) 


TOM FH ROM All days TOM FH ROM All days TOM FH ROM All days 
—lto4 —1to9 10to-2 -lto~i —tto4 —-1to9 Wto-2 -Ito-1 —-I}to4 -lto9 Wto-2 —-lto 1 
(a) Cash market data 
Value Line index: Avg daily VL index ret = 0.0388 
January 0.5483 0.2547 0.1087 0.1769 4.40 2.61 1.42 2.89 4.08 2.21 0.91 2.26 
February 0.3839 0.2479 0.1030 0.1784 4.00 3.61 1.56 3.72 3.59 3.04 0.97 2.91 
March 0.3229 0.1712 0.0099 0.0842 3.99 2.94 0.18 2.11 3.51 2.28 —0.53 1.14 
April 0.0054 0.0161 0.0051 0.0104 0.06 0.22 0.09 0.22 — 0.36 ~0.31 — (0.57 —0.81 
May 0.1464 0.1147 — 0.0084 0.0494 1.96 2.16 —0.14 1.20 1.44 1.43 -0.77 0.26 
June 0.2203 0.0709 0.0006 0.0336 2.63 1.18 0.01 0.75 2.16 0.53 — 0.58 ~0.12 
July 0.0278 0.0416  — 0.0927 — 0.0291 0.31 0.68 — 1.59 — 0.69 —0.12 0.05 — 2.26 — 1.60 
August — 0.0212 0.0036 0.0864 0.0493 —0.15 0.04 1.07 0.81 —0.42 ~ 0,38 0.59 0.17 
September 0.0495 —0.0804 — 0.0801 — 0.0802 0.53 — 1.04 —1.40 — 1.68 0.11 — 1.54 —2.07 ~ 2.49 
October 0.1272 0.0422. —0.3121 — 0.1518 1.34 0.55 — 1.61 —1.35 0.93 0.04 -1.81 - 1.70 
November 0.3421 0.2321 —0.0711 0.0768 2.33 2.46 — 1.07 1.32 2.06 2.05 — 1.65 0.65 
December 0.0521 0.0170 0.1410 0.0819 0.39 0.22 1.70 1.44 0.10 ~ 0.28 1.23 0.76 
All months 0.1837 0.0943 = —0.0112 0.0388 5.84 4.30 — 0.44 2.27 4.61 2.10 — 1.95 0.00 
All except Jan 0.1506 0.0797 = — 0.0225 0.0261 4.70 3.59 0.83 1.47 3.49 1.84 — 2.26 ~ 0.72 
S&P S00 index: Aug daily S&P index ret = 0.0501 

January 0.2730 0.0716 0.1371 0.1065 1.75 0.58 1.46 1.40 1.43 0.17 0.93 0.74 
February 0.2116 0.1471 0.0909 0.1202 1.79 1.68 1.01 1.93 1.36 1.11 0.46 1.12 
March 0.2220 0.0987 0.0297 0.0615 2.01 1.28 0.41 1.16 1.56 0.63 — 0.28 0.22 
April 0.0308 0.0432 0.0680 0.0560 0.24 0.42 0.82 0.86 —0.15 — 0.07 0.22 0.09 
May 0.1393 0.0872 0.0377 0.0609 1.35 1.15 1.48 1.1) 0.86 0.49 0.16 0.20 
June 0.2074 0.0579 0.0521 0.0548 1.70 0.67 0.65 0.94 1.29 0.09 0.03 0.08 
July — 0.0456 0.0348 — 0.0332 — 0.0009 —0.37 0.44 — 0.44 —0.02 — 0.78 spa SH ~— 0.94 
August 0.0234 0.0540 0.1129 0.0865 0.15 0.54 1.1) 1.23 —0.17 0.04 0.62 0.51 
September 0.0138 — 0.0961 — 0.0443 — 0.0698 0.10 — 0.97 —0.56 —1.11 — 0.26 — 148 =1,19 — 1.90 
October 0.2029 0.1256 — 0.1763 — 0.0397 1.43 1.24 —0.69 —0.27 1.08 0.74 —0.89 —0.61 
November 0.2776 0.1500 — 0.0718 0.0364 1.89 1.38 —0.78 0.51 1.55 0.92 —1.33 —0.19 
December 0.1323 0.1073 0.1531 0.1313 0.78 1.05 1.66 1.91 0.48 0.56 0.11 1.18 
All months 0.1407 0.0734 0.0290 0.0501 3.59 2.65 0.88 2.30 2.31 0.84 — 0,64 0.00 
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(b) Futures market data 


Value Line futures: 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 

S & P 500 futures: 
January 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
All months 
All except Jan 


0.5367 
0.3537 
0.2795 
0.0419 
0.1362 
0.1601 
0.0204 
— 0.1085 
— 0.0024 
0.1516 
0.2528 
0.1488 
0.1642 
0.1304 


0.2437 
0.2042 
0.1839 
OOI 
0.1235 
0.1585 
= 0.0053 
= 0.0243 
— 0.0163 
0.1708 
0.2680 
0.2097 
0.1273 
0.1167 


0.1926 
0.2115 
0.1467 
0.0186 
0.1228 
0.0709 
0.0715 
— 0.0306 
— 0.0974 
0.0853 
0.2404 
0.0318 
0.0887 
0.0792 


0.0485 
0.1396 
0.0743 
0.0170 
0.0871 
0.0484 
0.0566 
0.0383 
— 0.1185 
0.1284 
0.1499 
0.1094 
0.0649 
0.0664 


0.1327 

0.0859 
— 0.0165 
— 0.0247 
— 0.0043 
— 0.0067 
— 0.1144 

0.0844 
— 0.0996 
— 0.3130 
— 0.1097 

0.1833 
— 0.0186 
— 0.0328 


0.1186 
0.0738 
~ 0.0060 
0.0480 
0.0254 
0.0493 
~ 0.0649 
0.0987 
— 0.0529 
— 0.1972 
— 0.1122 
0.1494 
0.0100 
— 0.0002 


0.1606 
0.1510 
0.0584 
— 0.0039 
0.0551 
0.0298 
— 0.0267 
0.0329 
— 0.0985 
— 0.1336 
0.0602 
0.1115 
0.0321 
0.0202 


0.0859 
0.1079 
0.0308 
0.0331 
0.0542 
0.0489 
— 0.0073 
0.0716 
— 0.0852 
— 0.0505 
0.0150 
0.1303 
0.0360 
0.0314 


3.43 
2.38 
2.25; 
0.26 
1.36 
1.29 
0.13 
— 0,52 
0.02 
0.94 
1.36 
1.02 
3.69 
2.82 


1.38 
1.47 
1.62 
0.07 
1.15 
1.11 
—0.03 
—0.13 
— 0.10 
1.02 
1.49 
1.22 
2.83 
2.51 


1.32 
2.02 
1.84 
0.15 
1.62 
0.80 
0.68 
— 0.24 
— 0.90 
0.74 
1.99 
0.35 
2.82 
2.50 


0.33 
1.38 
0.90 
0.14 
1.07 
0.49 
0.59 
0.33 
— 1.03 
1.07 
1.18 
1.05 
2.03 
2.06 


27 

93 
0.21 
—0.25 
— 0.05 
— 0.07 
— 1.36 
0.75 
— 1.06 
—1.12 
— 1.08 
1.80 
=j] 
—0.85 


113 
0.77 
— 0.07 
0.52 
0.30 
0.52 
—0.79 
0.89 
—0.56 
—0.55 
— 1.07 
1.45 
0.24 
0.00 


1.82 
2.15 
1.05 
— 0.05 
0.99 
0.47 
— 0.40 
0.39 
~— 1.38 
— 0.82 
0.76 
1.63 
1.32 
0.80 


0.98 
1.55 
0.53 
0.44 
0.91 
0.72 
—0.12 
0.89 
—1.15 
0.25 
0.18 
1.78 
1.34 
LAL 


Avg daily VL futures ret = 0.0321 


3.22 1.10 0.96 1.46 
2.17 1.72 0.58 1.70 
1.99 1.44 — 0.63 0.47 
0.06 —O.Lt — 0.58 — 0.45 
1.04 1.20 —0.45 0.41 
1.03 0.44 — 0.42 — 0.04 
— 0.07 0.38 —1.74 — 0.88 
— 0.67 —0.50 0.47 0.01 
— 0.24 -1.19 -1.41 — 1.83 
0.74 0.46 — 1.24 — 1.62 
1.19 1.72 — 1.40 0.35 
0.80 0.00 1.48 1.16 
2.97 1.80 ~ 1.39 0.00 
2.13 1.49 — 1.68 — 0.47 
Avg daily S & P futures ret = 0.0360 
LIS 0.09 0.79 0.57 
1.21 1.03 0.39 1.03 
1.30 0.47 -0.52 —0.09 
—0.16 —0.15 0.13 — 0.04 
0.81 0.63 —0.12 0.31 
0.86 0.12 0.14 0.19 
— 0.26 0.21 — 1.23 — 0.69 
— 0.32 0.02 0.57 0.44 
— 0.33 —1.34 =095 -1.64 
0.80 0.77 -0.65 -0.42 
1.29 0.90 —1.41 =OS 
1.01 0.70 1.10 1.29 
2.03 0.91 — 0.62 0.00 
1.74 0.94 — 0.80 -0.16 


Source: Hensel, Sick and Ziemba (1994). 
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Figure 8. Mean percentage daily returns in the cash and 

futures market for small and large capitalized stocks by trad- 

ing day of the month, May 1982—April 1992. (a) Small capital- 

ization stocks (Value Line composite) (top); (bottom) (b) 

Large capitalized stocks (S&P 500). Source: Hensel, Sick and 
Ziemba (1994) 


Mean of Cash 


more than all of the gains occurs in the first half 
of the month. The second half is at best noise. 
The effect is monthly dependent with the largest 
gains in January and size dependent with the 
small capitalized value line index of about 1650 
stocks having higher means and lower standard 
deviations than the large capitalized S & P500 in- 
dex. There is partial anticipation in the futures 
market as shown in Figure 8. For the small capi- 
talized value line index, the cash effect on day 
—1 is partially anticipated on days —4 to —2. 
Then the effect in the cash market on days +2 
and +3 is partially anticipated on day + 1. Hence 
the cash market effect on days —1 to +4 is as 
Ariel found for the 1963 to 1981 data with small 
gains on days —4 to —2. For the large capitalized 
S & P500 index, the results are similar except that 
there are higher returns in the cash market in the 
anticipation period (—4 to — 2) and lower returns 


in the —1 to +4 period. As Lakonishok and 
Smidt found, there are insignificant gains on day 
+4 except in January. Heavy arbitrage trading in 
the S& P500 is offered as the cause for this shift. 
Once the futures moves on day —4, the cash 
moves as well through the arbitrage of selling 
expensive futures and buying cheaper stocks. 
There is less anticipation in the Value Line, 
especially in the anticipation cash market because 
the Value Line is too difficult to arbitrage with its 
1650 stocks many of which are very small with low 
volumes. 

Hensel, Sick and Ziemba (1994) computed the 
growth on one dollar over the 10 years of their 
sample in the cash and futures markets for small 
and large capitalized stocks during the turn, first 
half, and the rest of the month as well as during 
all days by month. These values show the com- 
pounded effects of monthly as well as intramonth 
seasonality during the TOM, FH and ROM sub- 
periods and the futures anticipation. The results 
show very high returns during TOM for most 
months especially for the Value Line. The second 
half of many months are very negative with the 
worst month being October when $1 in 1982 
became 68¢ for the small stocks and about 80¢ 
for the large stocks in 1992. Meanwhile in Jan- 
uary’s TOM the $1 grew to $1.32 for small stocks 
and $1.15 for large stocks using only five days per 
year. Table 16 shows the details for the cash 
market. The futures results are similar and ap- 
pear in Hensel, Sick and Ziemba (1994). Addi- 
tional results on the S&P cash market from 1928 
to 1993 appear in Hensel, Sick and Ziemba (1993). 

The reasons for the turn of the month effect 
are several but they are largely cash flow and 
institutionally based. The US economy works on 
a system where much money is paid on the —1 
day such as salaries and bills and debt payments. 
In additions to some of this money being invested 
in the stock market, there are institutional corpo- 
rate and pension fund purchases at that time. 
These cash flows vary by month and lead to 
higher average returns in January which has the 
highest cash inflow. Ogden (1990) presents some 
empirical support of this hypothesis and related 
monetary actions for US markets. Another factor 
in this effect seems to be behavioral. One mani- 
festation is that bad news such as that relating to 
earnings announcements is delayed and an- 
nounced late in the month while good news is 
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released promptly at the beginning of the month; 
see Penman (1987). 

Hensel, Sick and Ziemba (1994) also found 
that the return distributions were very different 
during TOM, FH and ROM. In the cash market, 
the normality of the first half price changes can- 
not be rejected, but the returns were non-normal 
during the turn-of-the-month and the rest of the 
month. The high returns during TOM and low 
returns during ROM in the cash market appar- 
ently lead to the rejection of normality because of 
their skewness. The smoothing effect of the fu- 
tures anticipation yields distributions that are not 
skewed enough to reject the normality hypothe- 
sis. As Ariel (1987) found for the 1963 to 1981 
data, the higher returns during TOM and ROM 
are caused by a slight shifting of the return distri- 
bution and not by a few outliers. 

The turn of the month is similar in Japan 
except that the dates change with the turn being 
days —5 to +2, with +3 to +7 being the rest of 
the first half of the month. Ziemba (1989, 1991) 
investigated this; see also the more detailed sum- 
mary in Ziemba and Schwartz (1991). The rea- 
sons for the effect in Japan seem to be: 

e Most salaries are paid during the period of 
the 20-25th day of the month, with the 25th 
being especially popular. 

e There is portfolio window dressing on day 
-1. 

e Security firms can invest for their own ac- 
counts on amounts based on their capitalization. 
Since their capitalization usually rises each month 
and is computed at the end of the month, there is 
buying on day —3 to account for this. Buying is 
done as soon as possible. 

e Large brokerage firms have a sales push that 
lasts 7 to 10 days and this starts on day —3. 

e Employment stock holding plans and mutua! 
funds receive money in this period to invest, 
starting around day —3. 

e Individual investors buy mutual funds with 
their pay, which they receive on calendar days 15 
to 25 of the month; the funds then invest in 
stocks with a lag, so most of the buying occurs on 
days —5 to +2. 

e For low liquidity stocks, buying occurs over 
several days by dealing in accounts to minimize 
price pressure effects. 

Using data on the NSA from 1949 to 1988 
Ziemba (1991) found that all of the days —5 to 


+2 had significantly positive returns as shown in 
Table 17 and Figure 9. As in the US all the gains 
occurred in the first half of the month and the 
second half had zero or negative returns. 
Ziemba (1989) investigated the futures market 
trading outside Japan on the Simex in Singapore 
on the turn of the month and other anomalous 
effects in Japanese security markets during the 
period September 1986 to September 1988 before 
there was futures trading on the NSA or Topix in 
Japan. He found that the spot effect was consis- 
tent with past data so the futures market did not 
alter the effect. However, the futures market in 
Singapore totally anticipated the effect on days 
~8 to —5 with a total average rise on 2.8%. 
Then when the effect occurred on days —5 to +2 


Table 17 

Growth of $1 investment in the cash market for small and 
large capitalized stocks in various periods, May 1982 to April 
1992 


Growth of $1 investment 


TOM FH ROM All days 
-lto4 -1lto9 10to-2 -lto-1 


Value Line cash market: 


January 1.31539 1.29004 1.13193 1.46024 
February 1.21160 1.28128 1.09938 1.40861 
March 1.17520 1.18676 1.01164 1.20057 
April 1.00272 1.01623 1.00556 1.02187 
May 1.07595 1.12155 0.99050 1.11089 
June 1.11645 1.07345 1.00064 1.07413 
July 1.01399 1.04248 0.90220 0.94053 
August 0.98944 1.00359 1.11211 1.11610 
September 1.02506 0.92279 0.92078 0.84969 
October 1.06565 1.04312 0.68550 0.71505 
November 1.18652 1.26119 0.92802 1.17041 
December 1.02641 1.01711 1.16779 1.18778 
All months 3.01108 3.10035 0.86112 2.66979 
All except Jan 2.28912 2.40330 0.76076 1.82832 
S & P 500 cash market: 
January 1.14627 1.07421 1.16913 1.25589 
February 1.11162 1.15848 1.08725 1.25956 
March 1.11741 1.10378 1.03535 1.14280 
April 1.01551 1.04413 1.07551 1.12298 
May 1.07212 1.09108 1.04347 1.13851 
June 1.10926 1.05961 1.06061 1.12383 
July 0.97746 1.03546 0.96383 0.99801 
August 1.01477 1.05548 1.14891 1.21266 
September 1.00693 0.90834 0.95544 0.86786 
October 1.10680 1.13381 0.80792 0.91603 
November 1.14888 1.16187 0.92741 1.07753 
December 1.06837 1.11331 1.18344 1.31753 
All months 2.32626 2.41413 1.46965 3.54792 
All except Jan 2.02942 2.24736 1.25705 2.82503 


Source: Hensel, Sick and Ziemba (1994). 
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eel Table 18 
0.20 Mean returns in percent on the NSA by trading day of the 
0.15 | month, 1949-1988 
0.10 Trading Sample Mean t- 
0:05 day size return, % statistic 
ane A -5 471 0.0899 2.75 
-4 471 0.1041 2.65 
BPE -3 471 0.1733 3.82 
KeS: ` 11213141516171819202122 -2 sm 0.1334 3:18 
~5-4-3-2-1 234567 8 910111213 =ï 471 0.2255 5.36 
Figure 9. Rate of return by trading day-of-the-month, NSA, 1 471 0.0980 2.43 
1949-1988, Source: Ziemba (1991) 2 471 0.1006 2.47 
3 471 0.0307 0.72 
4 471 0.0592 1.15 
and the spot market gained 1.7%, the futures 5 pa Sone Sat 
market was flat; see Figure 10. 5 4 aie DO 
Cadsby and Ratner (1991) investigated turn of 8 471 — 0.0585 -134 
the month and holiday effects in the US, Canada, 9 471 0.1065 2.46 
the UK, Australia, Switzerland, West Germany, 10 471 0.0620 1.58 
Japan, Hong Kong, Italy and France. Their use of a Fe Hae : = 
—1 to +4 as the turn of the month in all these 13 471 00115 024 
countries makes their results hard to interpret 14 471 ~ 0.0042 ~0.10 
since there may be cash flow and institutional 15 471 — 0.0306 —0.54 
constraint date alterations as in Japan. Still they 16 471 0.0716 1.35 
find a significant turn of the month in most 17 471 — 0.0498 ~ 1.20 
i es 18 429 — 0.0207 -0.49 
countries as shown in Figure 11. 19 350 0.0162 036 
A good example of a significant turn-of-the- 20 229 ~ 0.0286 ~0.48 
month effect with different dates is that in Italy. 21 118 — 0.0476 — 0.46 
The month there essentially starts on the 15th 22 39 0.0562 0.45 


(approximately) and all trades from then up to 
the following 14th are settled on the last trading 
date of that calendar month. This system leads to 
high returns from the 15th to the end of that 
calendar month and low or negative returns from 
the +1 day to the 14th; see Barone (1990). Not 
surprisingly, Italy has not had high returns on 
days — 1 to +4 and does not appear in Figure 11. 

I investigated more recent data from the UK 
in addition to futures market data. Their results 


Spot Data 1949-88 
trading days 
1st Half 22-7 .01142 


2nd Half 8-21 .00048 


Source: Ziemba (1991), 


from 1984-92 show a significant turn-of-the- 
month effect. Anticipation in the futures markets 
on trading day —3 provides a TOM from —3 to 
+3 as shown in Table 19 and Figure 12. As with 
other countries, the effect is monthly dependent 
and the strongest months are similar to those in 


Futures Market Effects: 1986-88 


anticipation 


Al Ist Half 2nd Hair 


NSA E Speten TSE W Futures on 
SIMEH 


Figure 10. Daily rates of return in the first and second halves of the month in Japan, cash data, 1949-1988, futures market effects 
1986-88. Source: Ziemba (1989) 
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Figure 11. International turn-of-the-month-effects. Daily rates 
of return in % for Canada (1/75-12/87), Hong Kong (1/80- 
89), UK (8/83-6/88), Australia (1/80-8/89), Switzerland 
(1 /80-8/89), West Germany (1/80-8/89), and France 
(1/80-8 /89). Source: Cadsby and Ratner (1991) 


the US with the early months especially January 
and the late months the strongest. 

The turn-of-the-month effect seems to be a 
worldwide phenomenon. The paper by Mar- 
tikainen, Perttunen and Ziemba (1994), although 
it only considers the brief period from January 
1988 to January 1990, shows this with various 
indices for a variety of regions of the world. They 
also found that returns on individual days during 


Table 19 
The turn-of-the-month effect in the UK, 1984-1992 


WFutures O Cash 
Figure 12. The turn-of-the-month effect in the UK, mean 
percent returns on trading days — 5 to + 14, 1984-1992. 


TOM, especially —1, were high. The latter aver- 
aged 0.41% per day over 24 countries with a £ 
value of 3.89 and was the only trading day with a 
mean return different from zero. 


The January barometer 


Around the world, January returns are gener- 
ally very high. In many countries and particularly 
in the US markets, small stocks greatly outper- 
form large stocks in January. For large capital- 


Trading Futures market Cash market 
day Data period: May 4th, 1984 Data period: May 4th, 1984 
—July 17th, 1992 ~May 10th, 1992 
Sample Average t-statistic Sample Average t-statistic 
size return % size return % 
-5 98 -0.025 —0.252 96 — 0.063 ~ 0.603 
—4 98 — 0.035 ~ 0.363 96 —0.017 — 0.194 
= 3 98 0.127 1.201 96 0.104 1.228 
=2 98 0.161 1.505 96 0.176 2.076 
Sd 98 0.105 0.795 96 0.158 1.767 
l 98 0.633 4.830 96 0.163 1.966 
2 98 0.094 0.930 96 0.171 2.062 
3 98 0.118 1.266 96 0.125 1.625 
4 98 — 0.007 — 0.066 96 0.050 0.598 
5 98 0.065 0.669 96 0.092 1.101 
6 98 0.037 0.359 96 0.042 0.469 
7 98 — 0.105 — 1.038 96 — 0.093 —1.112 
8 98 0.004 0.033 96 0.058 0.547 
9 98 0.004 0.043 96 0.080 0.932 
10 98 0.152 1.662 96 0.144 1.929 
11 98 0.069 0.640 96 0.089 1.114 
12 98 0.004 0.044 96 0.084 1.080 
13 98 = 0.239 — 1.185 96 — 0.146 — 1.126 
14 98 — 0.253 — 1.806 96 — 0.198 — 1.354 
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-1.50 
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 


Figure 13. S&P500 percentage returns by month, from Febru- 
ary 1928 to June 1993. Source: Ibbotson and Associates; data 
in Hensel and Ziemba (1993a) 


ized stocks, January returns have been higher 
than average but not higher than many other 
months. For example, for the sixty-five years from 
January 1928 to June 1993, fully six other months 
have similar returns as shown in Figure 13. 

However, January seems to be a valuable sig- 
nal for the whole year. 

If large stocks have positive returns in January, 
then it is likely that the market as a whole will 
rise in the rest of that year (ROY). Yale Hirsch 
(1986) has called this the January barometer. The 
supposition is that: 
if the market rises in January, then it will rise in the 
rest of the year; but if it falls in January, then there 
will be a decline or a flat market in the rest of that 
year. 


31 Years 
34 Years ROY UP 91% 
January 
UP 3 Years 
ROY 9% 
DOWN 
54 Years 
Total 
10 Years 
20 Years ROY UP 50% 
January 
DOWN 
10 Years 
ROY 50% 


DOWN 


Table 20 from Hensel and Ziemba (1993a) has 
the results for the 54 years 1940-93. 

When the return in January was positive, the 
rest of the year was up 91% of the time. This 
compares with only 76% of all years that the rest 
of the year was up. 

When the return in January was negative, the 
rest of the year was down 50% of the time. Thus 
even in years when January is down, the rest of 
the year is equally likely to be up or down. 

January 1994 had a gain of 3.2% hence a gain 
in the rest of 1994 is predicted. 


Time of day effects 


Harris (1986a) has documented a time of day 
effect for NYSE securities. Figure 14 shows the 
typical behavior using all NYSE stocks from 1981 
to 1983. Consistent with the day of the week 
finding there are share price rises on Tuesday to 
Friday and declines on Mondays. Price action 
occurs mostly in the first 45 minutes and then 
there is little movement until a sharp rise near 
the close of trading. Harris (1986b) discusses this 
latter move and details the very high percentage 
of final trades that are at price increases. The 
percentage of stocks advancing on their last trade 
of the day is usually above 80%. As pointed out 
by Kevin Rowe, this may be largely attributed to 
specialists book balancing to end the day. Admati 
and Pffeidler (1988, 1989) present a formal theory 
that attempts to explain the intra and inter day 
changes based on the trading of participants with 
differential information. 

Japanese stocks seem to have all their gains 
during the non trading period at night. Table 21 
using data on the Topix from 1974-87 with yearly 
rebalancing shows this in a rising market and 
Table 22 for a declining market. The effect seems 
to be broker related with solicitations of buy 
orders obtained after market closing on the previ- 
ous day that are executed on the open in the cash 
and futures market. Table 21 also illustrates the 
size effect in which the smallest capitalized quin- 
tile had returns more than twice as high as the 
largest capitalized quintile. Table 22 shows that 
even though the NSA fell over 50% from January 
1990 to March 1992 there were net gains at night. 
Bell and Ziemba (1993) show that this has contin- 
ued in the remainder of 1992. Keim and Smir- 
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Table 20 lock (1987) found consistent but not as sharp 
The January barometer 1940-1993 results in the US for small stocks with the Value 
Year S&P500 S&P500 Direction Did it Line small capitalized index having its major gains 

(%) (%) of signal work? at night. 
Jan Rest of Tan Rest of 
year 
year , 

1993 07 63 an ip yes Can professional fund managers beat the market 
1992 -1.9 4.3 down up no averages: 
1991 4.3 22.3 up up yes N A ` E 3 
1990 -6.9 37 down up no The evidence presented in this review is poten- 
1989 7.0 20.4 up up yes tially useful in the design and implementation of 
1988 4.2 11.4 up up yes superior portfolio management strategies. In par- 
1987, “126 “7S. Bp down ao ticular, the aim is to have higher average returns 
1986 0.4 16.5 up up yes ith high isk th h k 
1985 74 20.5 up ùp yès with no higher ris than the market averages. 
1984 -0.7 67 down up no Shukla and Trzcinka (1992) survey methods to 
1983 3.4 16.9 up up yes evaluate managed portfolio performance. Various 
1982 =1.6 21.0 down up no authors such as Cowles (1933), Sharpe (1966), 
Ae Ti ae Son conn s Jensen (1968), Brown, Friend, Herman and Vick- 
Do AA Gee sae A shes ers (1962), Friend, Blume and Crockett (1970), 
1978 —61 12.5 down up no Henriksson (1984), Brinson, Hood and Beebower 
1977 —50 -2.4 down down yes (1986), Grinblatt and Titman (1989), Elton et al. 
1976 113 10.1 up up yes (1991), and Lakonishok, Shleifer and Vishny 
1913 ALS 198 up up yes (1992) have found that professional and pension 
1974 —0.9 — 29.9 down down yes 2 
193 -16 -142 dowi down Yes fund managers do not usually outperform their 
1972 1.9 15.5 up up yes benchmark portfolios once transactions costs and 
1971 4.1 9.3 up up yes fees are considered. Table 23 shows the usual 
1970 -7.7 11.7 down up no behavior. 
1969:  =0.7 11:7.. down <p no This and as suggested by the capital asset 
1969 -0.7 -8.2 down down yes as i lusi hold ks i 
1968 —43 148 down up hia pricing model s conclusion to old stocks in 
1967 7.7 13.8 up up yes weights proportional to their capitalization pro- 
1966 0.6 -11.2 up down no vides a strong case for index funds. Indeed, there 
1965 3.4 8.3 up up yes is much support for this view in the pension fund 
1364 2.8 125 up up yes management industry and hundreds of billions of 
1963 4.9 15.6 up up yes doll d. Wells F in San F 
1962 -3.7 -54 down dows yes lollars are so managed. Wells Fargo in San Fran- 
1961 6.3 17.6 up up yes cisco alone so manages a sum in the $100 billion 
1960 -73 7.7 down up no range. 
1959 0.5 10.8 up up yes The reasons why index funds typically outper- 
1338 TA SU Mp “E ae form active management are: 
1957 —4.1 =73 down down yes Th k fully i d ll 
1956 -3.5 99 down up iò f e e mar et averages stay fully investe at a 
1955 2.0 25.5 up up yes times, never missing market moves nor paying 
1954 5.2 37.1 up up yes commissions for stock changes and market tim- 
1953 —0.5 —0.5 down down yes ing. 
1952 1.8 15.1 up up yes 
1951 6.2 15.3 up up yes Table 21 
1950 1.9 25.6 up up yes Š r y 
1949 04 16.8 up up yes Topix mean returns for size related portfolios, 1974-1987 
1948 —3.9 9.2 down up no Firm size Average Mean return 
1947 2.5 3.0 up up yes market Value 
1946 69 -15.3 up down no (¥ million) a H ao S apep a 
1945 1.6 29.5 up up yes p 
1944 1.7 16.3 up up yes Smallest m3 0.1492 0.1784 — 0.0295 
1943 71 159 up up yes 2 1,716 0.1058 0.1545 — 0.0489 
1942 1.6 16.9 up up yes 3 3,275 0.0859 0.1354 — 0.0495 
1941 —4.7 —7.6 down down yes 4 6,554 0.0764 0.0976 — 0.0213 
1940 —3.4 -6.9 down down yes Largest 32,689 0.0653 0.0483 0.0168 


Source: Hensel and Ziemba (1993a). 


Source: Kato (1990). 


Chapter 18: Worldwide Security Market Regularities 425 


222 W.T. Ziemba / World wide security market regularities 


Cumulative return during day (percent) { 
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Figure 14. The time of day effect on the MYSE. Source: Harris (1986a} 


e When funds get behind the index, they often dow dress at reporting times, adding to turnover 
make hasty moves to try to catch up and, more and commissions. 
often than not, this puts them further behind. e Since the managers collectively more or less 
e Portfolio managers have a tendency to win- are the market (with individual investors forming 
30,000% 
20,000% 
10,000% Group 1: 24,366% 
Group 2: 1,823% 
Group 3: 136% 
5,000% Group 4; -59% 
1,000% 
500% 
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Figure 15. Value Line timeliness rankings, revised weekly, April 16, 1965 to December 27, 1991. Source: Value Line Investment 
Survey 
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Table 22 
Mean returns in percent during the day and night by month in 
Japan, NSA, January 1990-March 1992 


Month Mean return 


Close to Open Open to Close 


All days 0.001150 — 0.002390 
Jan 0.003870 — 0.005780 
Feb — 0.003850 0.003768 
Mar 0.001790 — 0.005670 
Apr — 0.000130 — 0.000540 
May 0.003120 — 0.001650 
Jun 0.007450 — 0.010220 
Jul ~ 0.000960 0.000638 
Aug 0.006120 — 9.011000 
Sep — 0.008440 0.005170 
Oct — 0.002300 0.006320 
Nov 0.010820 —0.015820 
Dec — 0.003640 0.005923 


Source: Bell and Ziemba (1993). 


less and less of the market each year) the indices, 
on average, beat half of the fund managers. Then 
with commissions, fees, and these other reasons, 
only the 25% to 40% typically beat the market 
averages (which does quite well with a lot less 
work). 

e The fund managers take fees; the averages 
work for free. 

e The fund managers’ goals may get in the way 
of the fund’s best interests. This is the so-called 
agency problem. 


Table 23 


e Portfolio managers tend to follow each 
other’s moves. They tend to move the market 
which gains the full amount, and they can easily 
be a little behind. 

Hence although the evidence in this review 
suggests methods and strategies for beating mar- 
ket averages * it is clear that only a small number 
of managers are able to successfully implement 
such strategies over a long period of time. ° 


+ My own experience with a small private mutual fund based 
on anomalies has been encouraging. In 76 months from 
October 1987 to January 1994, the fund averaged 27.3% 
annualized (net of fees) with a standard deviation of 13.2%. 
Experience, size and diversification are important in such 
funds and the 1991-94 results were superior providing a 
similar mean return but with the much lower standard 
deviation of 8.1%. 

5 Lakonishok, Shleifer and Vishny (1992) used a data base on 
all 769 all-equity pension funds followed by SEI, a financial 
services company that specialized in evaluating the perfor- 
mance of pension plans and in recommending money man- 
agers. The results for 1983-89 for these managers, who 
collectively managed $124 billion as of the end of 1989 was 
underperformance of the S & P500 by 1.3% per year before 
management fees which averaged 0.5%. Hence, the average 
active manager subtracted rather than added value. How- 
ever, Hendricks, Patel and Zeckhauser (1991) show that 
managers that have had outperforming track records tend 
to continue to outperform. Ippolito (1989, 1992) and Goetz- 
mann and Ibbotson (1991) also confirm the existence of 
superior managers. Brown et al. (1992) show the effect of 
survivorship bias in such studies and how it over estimates 
the true return. 


Sector Median Returns of Hong Kong Based Funds Compared with Market Average, January 1 to November 30, 1988 


Class of fund Outperforming Reference Number of funds 

median market average % Change Market 

return, % average 
Japanese equity funds 19.1 TSE Index 34.9 4 of 30 
Hong Kong equity funds 24.4 Hang Seng index 20.8 9 of 15 
Australian equity funds 23.1 All-ordinaries index 43.3 lof lt 
Singapore /Malaysian 

equity funds 25.8 FT Actuaries Singapore 25.9 3 0f 7 
Asean equity funds 41.4 FT Actuaries HK and Singapore 27.7 Il of 13 
Far East equity funds 22.1 FT Actuaries Pacific 35.1 3 of 20 
US equity funds 7.6 S&P Composite 14.0 5 of 25 
British equity funds 6.8 FT All Shares Index 10.9 4 of 19 
European equity funds 13.0 FT Actuaries Europe 13.9 7 of 15 
International managed funds 9.4 FT Actuaries World 23.1 0 of 6 
British gilt funds 1.0 FT All Stock Bond index 5.0 1 of 6 
(28.7%) 48 of 167 


Source: Ziemba and Schwartz (1991). 
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Table 24 
Value line timeliness rankings (%), revised weekly, April 16, 
1965 to December 27, 1991 


Group 

1 2 3 4 5 1-5 
65 26.6 16.5 6.7 —0.4 -3.2 32.0 
"66 —5.5 —6.2 —13.9 -15.7 -18.2 12.7 


67 53.4 36.1 27.1 23.8 21.5 31.9 
68 37.1 26.9 24.0 20.9 11.8 25.3 
%9 -10.4 -17.5 -23.8 -33.3 —449 345 


70 73 —3.2 -80 -163 —23.3 30.6 
71 30.8 13,7 9.3 8.4 -5.5 36.1 
Ta 12.6 7 35 =7.1 —134. 269 


Ro -13.1 <39 -33.8 —318 2438 241 
"74 -11.1 -295 -34.1 -40.6 ~-55.7 446 
75 75.6 47.4 40.7 39.3 40.9 34,7 
"716 54.0 31.2 29.0 28.8 26.7 _ 273 


"77 26.6 13.4 13 —6.9 -176 41.2 
"78 32.6 18.3 3.0 —3.8 —3.2 358 
"79 54,7 38.0 20.7 12.8 10.4 41.8 
80 52.6 35.7 15.4 7.4 2.9 49.7 
’81 13.6 1.8 —3.3 -8.7 —214 35.0 


82 50.6 31.0 17.9 5.1 -10.9 61.5 


91 55.4 34.1 18.9 16.7 25:5- 299 
65-91 24.366 1823 136 —59 —97 


Source: Value Line Investment Survey. 


The Value Line enigma 


There is considerable evidence that the rank- 
ings made by the Value Line Investment Survey 
actually do have value to investors. Each week 
stocks in their universe (about 1650) are ranked 
into five groups based on their projected timeli- 
ness for the subsequent year. Table 24 and Figure 
15 show the theoretical results not counting 
transactions costs, dividends or taxes, of the 
strategies of holding all #1, #2, #3, #4 and #5 
ranked stocks and rebalancing whenever there is 
a rank change for the period April 16, 1965 to 
December 27, 1991. On the surface, the results 
are impressive. Long positions in #1s averaged 
24.2% over these 27 years. Moreover, a close to 
beta equal zero market neutral strategy that went 
long #1s and short #5s averaged 33.7% returns 
with an annualized standard deviation of 12.2% 


and had only one losing year. A strategy of going 
long #1s and holding for one year and then 
rebalancing had average returns of 16.6% over 
the 27 years. Several academics such as Black 
(1973), Copeland and Mayers (1982), Stickel 
(1985), Huberman and Kandel (1987, 1990) and 
Affleck-Graves and Mendenhall (1992) have stud- 
ied the reported results and in general confirm 
the superior behavior of the Value Line ranking 
model. 

Several questions arise. Why is Value Line 
able to beat the market with publicly available 
information and rankings? How reliable are the 
rankings? How should one implement such a 
strategy with actual funds? 

On the first question, it is known that Value 
Line’s model uses several elements that include 
price momentum, size and earnings variables. 
Analysis of rankings by Stickel (1985) concluded 
that the rankings were better than chance so that 
Value Line’s model has information that is not 
fully reflected in current stock prices. Copeland 
and Mayers (1982) show that excess returns peak 
at about thirteen weeks from the time a rank 
change is made. Hence, there is a lag effect. Ball 
and Brown (1968), Bernard and Thomas (1989, 
1990) and Abarbanell and Bernard (1992) have 
studied such stale information effects regarding 
earnings. There is considerable evidence that the 
market does not immediately factor new earnings 
information instantaneously. In particular even 
after three days after quarterly earnings an- 
nouncements stock prices have not adjusted to 
the new information. Jennings and Starks (1986) 
show that the speed of adjustment to quarterly 
earnings information is faster if options are traded 
in the stock. Affleck-Graves and Mendehall (1992) 
show that a substantial portion of Value Line’s 
superior performance is from rank changes in- 
volving earning surprises of small capitalized 
firms. A good strategy for fund managers is thus 
to invest in Value Line #2 stocks that have just 
reported earnings surprises, many of these stock 
will be elevated to #1 stocks shortly. 

There is superior long run performance of the 
Value Line #1s in comparisons with the #5s. 
However, occasionally the rankings do not pro- 
duce higher returns. This is usually in January 
and the first quarter and is likely associated with 
the DeBondt-Thaler winner-loser effect and the 
January small firm effect. Value Line #5s are 
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likely to be losers such as negative earnings stocks 
and they may have high returns in the next Jan- 
uary. 

Despite the superior performance, there have 
been very few mutual funds that have had suc- 
cessful implementations of the long #1 or long 
#1-short #5 strategy with real money. Obtaining 
precise transactions costs is difficult as is the 
ability to put on the positions before the market 
moves as there are over 100,000 copies of the 
Value Line Investment Survey as well as com- 
puter disks and on line service available to in- 
vestors each week. I have been implementing a 
version of the Value Line neutral investment 
strategy since April 1991, using put and call op- 
tion sales for Global Securities, a Vancouver in- 
vestment firm. The preliminary results are en- 
couraging and the investors have had returns that 
have beaten the market averages with lower risk. 
In the 33 months until the end of December 
1993, the annualized net rate of return was 18.9% 
which greatly exceeds the S&P500 with divi- 
dends. During the first half of 1992 the Value 
Line system was in a period where the rankings 
failed because the market favored value stocks 
rather than the growth stocks that the model 
favors. The option sale strategy was successful in 
this period. However, implementations that were 
long and short probably produced losses. 


Final remarks 


Behavioral factors along with cash flows and 
institutional constraints and policies are impor- 
tant reasons for anomalous behavior. Investor 
sentiment, however it may be defined (such as by 
the level of new IPOs, see Ritter, 1991), greatly 
affects stock prices. For example, Bailey and Lim 
(1992) found that the discounts and premiums of 
closed end country funds traded on the NYSE 
and ASE depend upon the current returns in the 
US rather than the returns in the foreign country. 
Hence the returns of say the Australian, Brazil, 
France or Italy funds tend to have higher short 
term correlations with the S& P500 than the home 
market indices. 

Lee, Shliefer and Thaler (1991) using data 
from 1965-85 showed that the discounts on pre- 
miums of general closed end mutual funds seem 
to be dependent upon general market sentiment 


which includes seasonality effects rather than the 
specific holdings of the fund. For example, dis- 
counts shrink in January when there are high 
small stock returns. Yet Brauer and Chang (1989) 
found that the funds’ underlying assets had no 
January small firm effect. The results seem to be 
based on the existence of two types of investors: 
rational traders, who invest based on fundamen- 
tals, and noise traders who base their investments 
at least in part on the basis of irrational factors 
that can be overly optimistic or pessimistic about 
the future; see DeLong, Shleifer, Summer and 
Waldmann (1990) and DeLong and Shleifer 
(1992). 

Bell and Ziemba (1993) and Ziemba (1994a) 
found that the anomalies fail to behave as ex- 
pected during periods of extreme market volatil- 
ity. Ziemba studied the US markets during the 
buildup and subsequent 1987 stock market crash 
and Bell and Ziemba study the Japanese market 
during the 1990-92 decline of over 60%. In both 
cases market sentiment factors were dominant 
and cash flow considerations secondary. In the 
case of Japan the cash flows were substantial but 
they did not become invested in the stock market 
because investors expectation of future returns 
was negative. 

This survey covers several anomaly topics for 
US and several other equity markets. Effects in 
other markets such as interest rates (see Froot, 
1993), bond and commodity prices are fruitful 
areas of study. So are the effects of various events 
such as stock splits, é and S & P500 additions and 
deletions. Some of these topics are discussed in 
Ziemba (1994a). 
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1. Introduction 


The Fed model postulates that the earnings yield E/P of a stock index tends to move 
in the same direction as the treasury bond yield Y thus giving a way to predict price move- 
ments in the equity market: if E/P is much less than Y one might expect a decrease in 
the equity prices P. This effect was studied already in Ziemba and Schwartz (1991) and 
it became widely known with the Humphrey—Hawkins Report Board of Governors of the 
Federal Reserve System (1997). The idea is that bonds and stocks compete for investment 
funds and those funds tend to move towards the more attractive investment. Ziemba and 
Schwartz (1991) used the difference Y — E/P as a stock market danger indicator. They 
showed that this was accurate in predicting major crashes in the stock markets of US and 
Japan during 1948-1989, Berge and Ziemba (2003) made a further study of the predictive 
ability of this indicator for predicting stock price rises and declines with various data esti- 
mation techniques for the US, Japan, Germany, Canada and the UK. For 1970-2003 Berge 
and Ziemba (2003) found that the measure provided signals to enter and exit the stock mar- 
kets superior to buy and hold strategy and provided signals that predicted large declines. 
Practitioners such as Yardeni (2003) have also used this measure to try signal large stock 
market moves. Ziemba (2003) details many successful applications of the idea in various 
equity markets. Campbell and Vuolteenaho (2004) show that during 1927-2002 the bond 
yield and proxies of the risk premium are strongly statistically significant in explaining 
variations in the log stock yield (either the earnings-price ratio or the dividends-price ratio). 
Despite the empirical success in describing the behavior of stock prices and its popularity 
among practitioners the Fed model has also been criticized for being theoretically invalid; 
see, e.g., Asness (2003) and Campbell and Vuolteenaho (2004). The main criticism is that 
the Fed model compares a nominal quantity Y to the real quantity E/P. 

This paper studies the Fed model from statistical point of view with a focus on the 
ability of the Fed model to forecast changes in equity prices, earnings and bond yields. 
This is done through studying the logarithmic indicator defined by 

iin InY, ~ In E, + In P, 1 

=n) = in n E; + 1n Py. (1) 
Taking logarithms of positive quantities is common practice in econometrics and it has two 
important advantages. First, proportional variations in the values of strictly positive vari- 
ables are often more meaningful than absolute ones. Second, logarithmic variables often fit 
statistical models better than original ones. Figure 1 displays the logarithmic indicator to- 
gether with logarithmic values of the general equity market index for the US. During 2001 
stock market prices fell and the indicator moved to a lower level indicating less danger in 
the market. However, in late 2001, the fall in stock market prices was exceeded by the fall 
in earnings. This lead to the indicator moving even higher which anticipated the 22% fall in 
the stock prices in 2002. In late 2002 the indicator moved to a low level which is consistent 
with the rise in stock prices in 2003. 

The main idea of the paper is to develop a linear time series model around the log 
indicator and to test whether the indicator improves the predictive power of the model. 
Being linear in all the logarithmic variables suggests that 7, might serve as an equilibrium 
correction term in a Vector Equilibrium Correction (VEC) model for the logarithmic vari- 
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Fig. 1. Datastream log US market index and the log indicator (right scale, dotted line) during 1980/1-2003/4. 


ables; see Engle and Granger (1987) and Johansen (1995). The resulting time series model 
provides a quantitative version of the FED-model and it gives predictions not only of the 
equity price index but all the variables in the model. 

Out-of-sample forecast tests in the US, the UK, and Germany indicate that VEC-models 
with the logarithmic indicator outperform Vector autoregressive (VAR) models, without the 
indicator. 

The paper is organized as follows. Section 2 describes the data used in the tests. In 
Section 3, we study the stationary properties of the time series, and in particular of the log 
indicator. In Section 4, we develop and analyze VEC-models for markets in the US, the 
UK and Germany. Section 5 compares the predictive ability of the VEC-models against 
VAR-models in out-of-sample forecast tests. Section 6 concludes. 


2. Data 


The stock market price and earnings data as well as the bond yields for the three markets 
are from Datastream. Datastream total market price indices for the United States, United 
Kingdom and Germany represent the behavior of the stock prices in all three markets. The 
earnings (past) corresponding to the indices are calculated from the reported price earnings 
(P/E) ratios. The bond yields represent the yields on ten-year government benchmark 
bonds and are calculated by Datastream. Our dataset covers quarterly observations from 
January 1980 to December 2003. 


3. Tests for cointegration 


This section studies the stationarity properties of the time series. Figures 2—4 display 
the historical values of In ¥/, In P$, In EZ! and Z} from 1980/1 to 2003/4, where i = US, UK 
and GER, respectively. Even though the log bond yield, stock price index and earnings 
appear non-stationary, their linear combinations (J!) are stationary in all three markets; see 
Figs. 2—4. 
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Fig. 2. Logarithmic time series for the United States, 1980/1-2003/4. 


1. 3 
80 82 84 86 88 90 92 94 96 98 00 02 80 82 84 86 88 90 92 94 96 98 00 02 


(a) n YP. (b) In PPX (solid line) and In BUX, 


4. 
80 82 84 86 88 90 92 94 96 98 00 02 
(c) YK. 


Fig. 3. Logarithmic time series for the United Kingdom, 1980/1-2003/4. 
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Fig. 4. Logarithmic time series for Germany, 1980/1-2003/4. 
Table | 
Unit root test statistics of the time series between 1980/1 and 2003/4 
US UK GER 
Time series nl z(t) t-statistic nl z(t) t-statistic nl z(t) t-statistic 
InY 0 (i) —1.30 0 (1) —0.97 0 (1) —1.28 
In P 0 (1,t) ~1.75 0 (1, ¢) —1.49 0 (le) ~1,94 
InE 1 (i,t) —2.96 2 (1,4) —2.37 0 (i,t) —2.91 
AlnY 0 (0) —10.02" 0 (0) —9,37" 0 (0) -8.23" 
AlnP 0 (1) —9,36" 0 (1) -9,95" 0 (1) —9.40" 
AlnE 0 a) —4.93* 1 d) 5.47" 0 (1) ~8.60" 


Notes. The table reports results of the augmented Dickey—Fuller tests for all the time series. The number of lags 
(nl) in the tests have been selected using the Schwarz information criterion with a maximum of twelve lags. 
The value of z(t) indicates the deterministic terms included in the unit root regressions. When z(t) = 0 the 
deterministic terms are omitted, with z(t) = 1 a constant is included and with z(t) = (1, t) a constant and a trend 


are included. 
“ Indicate the rejection of the unit root null at the 1% significance level. 


We study, using data from 1980/1 to 2003/4, the stationarity properties of In Y$, In PÍ 
and In EÍ by performing augmented Dickey—Fuller (Dickey and Fuller, 1981) tests for all 
the time series. The results of the unit root tests in Table 1 are very similar across the 
markets. The log bond yield, log stock price index and log earnings are non-stationary and 
their first differences are stationary. 
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Fig. 5. Unit root test results for the logarithmic indicators. Notes. Figures display the p-values of augmented 
Dickey—Fuller tests for the logarithmic indicator on subsamples ranging from 1980/1-1990/1 to 1980/1-2003/4 
for the three markets. Low p-values, e.g., below 5%, indicate the rejection of the unit root null. 


Next we test the stationarity of the indicator, In ZÌ, using different subsamples of our 
data set. Figure 5 displays the p-values of augmented Dickey~Fuller tests for the loga- 
rithmic indicator on different subsamples for all the three markets. The initial estimation 
period used in the unit root tests is £ = 1980/1,..., 1990/1. After this, the sample size is 
expanded by one observation at a time and the test statistic is recomputed for each sub- 
sample (t = 1980/1 to 1990/2, £ = 1980/1 to 1990/3, etc.). The results in Fig. 5, where 
low p-values (e.g., below 5%) indicate the rejection of the unit root null, confirm the sta- 
tionarity of the log indicator in the US over all the considered subsamples. In the UK and 
Germany the indicator is fairly stationary until the end of 2001, since the p-values are 
below 10% over most of the subsamples. Figures 3—4 show that the rapid decline of the 
log indicator towards the end of year 2001 causes the p-values of the ADF tests to in- 
crease to over 10% and 20% in the UK and Germany, respectively. Since, the indicator 
appears fairly stationary over most of the subsamples in all the markets, thus implying 
cointegration between the time series, we estimate VEC models to test whether the indi- 
cator can be used to explain the observed variations in the time series and especially in 
equity prices. To study how the recent market developments affect the resulting parameter 
estimates, we analyze all the estimation results using two different samples. The first sub- 
sample covers observations before the latest market turmoil, from 1980/1 to 2000/4, where 
the indicator appears very stationary, and the second sample uses the whole dataset, 1980/1 
to 2003/4. 
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4. Statistical analysis of the Fed model 


To test the predictive ability of the indicator, we estimate a Vector Equilibrium Correc- 
tion (VEC) model popularized by Engle and Granger (1987) and Johansen (1995) for the 
vector process 


In Y, 
Xt = In P; 
In E, 


This approach is similar to the Vector autoregression (VAR) approach used in Campbell 
and Shiller (1988), and it has similar advantages. A VEC model is obtained from a VAR 
model by adding an “equilibrium correction” term to it. It is a linear time series model of 
the form 
k 
Ax, =c +) AjAx-i +0(B'x-1-M) +6, € ~ NO, E), (2) 


i=l 


where c € RÌ, A; € R3*3, B e R?”!, u e R', a e R?* and X e R?” is the covariance 
matrix for the normally distributed random innovations. The third term on the right takes 
into account the long-term behavior of x; around statistical equilibria described by the 
linear equations f’x = yz. It is assumed that, in the long run, 


E[p'x;)] = p, (3) 


and that if x, deviates from the equilibria (due to shocks in economic conditions) it will 
tend to move back to them. The matrix œ determines the speed of adjustment towards 
the equilibria. In this sense, VEC-models incorporate long-run equilibrium relationships 
(often derived from economic theory) with short-run dynamic characteristics deduced from 
historical data. 

We are interested in whether or not the indicator 7; could serve as an equilibrium term 
in a VEC-model for xp, and whether it explains the variations in x;, and particularly on 
P, as the Fed model stipulates. As described in Section 3, we assume that the indicator 
is stationary, i.e., choose 8’ = [1, —1, 1], and estimate the remaining parameters of (2) 
from historical data. Although our approach resembles the method of Engle and Granger 
(1987) it is not a valid statistical approach. If there is actually no cointegration between the 
time series, i.e., the indicator is nonstationary, our model will be incorrectly specified and 
the t-statistic inference of the VEC model will be invalid. However, VEC model offers a 
natural way for modeling the phenomenon and testing the predictive ability of the indicator. 

Following the findings of the unit root tests in Section 3, we present the estimation 
results for two different samples. The first subsample covers observations from 1980/1 
to 2000/4 and the second the whole dataset, 1980/1 to 2003/4. The resulting parameter 
estimates and the corresponding t-values (in parenthesis) are given in Tables 2—4 for the 
three markets. Tables 2—4 also report as summary equation statistics the R-squared (R?) 
and adjusted R? values, equation standard errors (S.E.), p-values of F-statistic, values 
of the log likelihood function as well as mean and standard deviation of the dependent 
variables. The R? statistic measures the success of the regression in predicting the values 
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Table 2 
Parameter estimates, f-values and summary statistics for the United States 
1980/1-2000/4 1980/1-2003/4 
AlnP; AlnE; Alny, Aln P Aln E Aln Y; 
Aln P, —0.046 —0.071 0.140 0.040 —0.068 0.137 
(—0.413) (—1.724) (1.038) (0.393) (1.801) (1.176) 
AlnE;_ 0.196 0.504 —0.008 0.148 0.608 —0.149 
(0.735) (5.168) (—0.026) (0.635) (6.911) {—0.551) 
AlnY,-ı —0.124 0.000 0.146 —0.071 0.002 0.102 
(—1.284) (0.008) (1.257) (—0.782) (0.067) (0.976) 
In Z1 —0.123 0.025 —0.142 —0.139 0.009 —0.150 
(—2.311) (1.271) {—2.227) (—2.915) (0.481) (—2.723) 
c 0.618 —0.108 0.671 0.687 —0.034 0.710 
(2.421) (-1.154) (2.190) (2.999) (—0.394) (2.686) 
R? 0.126 0.285 0.069 0.132 0.353 0.085 
Adj. R? 0.081 0.248 0.021 0.093 0.324 0.044 
S.E. equation 0.072 0.026 0.086 0.078 0.030 0.091 
Prob(F -statistic) 0.032 0.000 0.231 0.013 0.000 0.091 
Log likelihood 102.000 184.700 87.014 108.447 200.220 94.883 
Mean dependent 0.031 0.017 —0.007 0.026 0.014 —0.008 
S.D. dependent 0.075 0.030 0.087 0.082 0.036 0.093 


Notes. The table reports parameter estimates and t-statistics (in parenthesis) from the VEC model for two different 
subsamples, 1980/1-2000/4 and 1980/1-2003/4. As summary equation statistics the table reports R-squared (R?) 
and adjusted R? values, equation standard errors (S.E.), p-values of F -statistic for testing the joint significance of 
the regressors, log likelihood function values as well as mean and standard deviation of the dependent variables. If 
the reported p-value of the F-statistic is less than a specified significance level, say 0.05, the null hypothesis that 
all equation coefficients are equal to zero is rejected. The datastream US market price index is used to represent 
the equity prices and the earnings are calculated from the reported P/E ratios. The bond yields correspond to the 
Datastream US ten-year government benchmark bond index. 


of the dependent variable and may be interpreted as the fraction of the variance of the 
dependent variable explained by the regressors. One problem with using R? as a measure of 
goodness of fit is that R? will never decrease as more regressors are added. The adjusted R? 
penalizes for the addition of regressors which do not contribute to the explanatory power 
of the model, can decrease as regressors are added, and for poorly fitting models, may be 
negative. The reported F-statistic p-values are from a test of the hypothesis that all of the 
coefficients (excluding the constant) in a regression are zero. If the p-value is less than a 
specified significance level, say 0.05, the null hypothesis that all equation coefficients are 
equal to zero is rejected. 

In the US the results are similar across the considered samples. The indicator, /;, has 
a Statistically significant negative effect on the equity prices, as the Fed model postulates, 
but it also enters the bond yield equation with a significant negative coefficient. Thus, 
the indicator pushes the equity prices and bond yields to the same direction, which can be 
interpreted as a two way correction to an observed disequilibrium between the current bond 
yields and equity earnings. The changes in log earnings are well explained by its lagged 
values which account for a high proportion of the variation in the time series, as is apparent 
from the relatively high R? for the earnings equation. This is in line with the findings of 
Campbell and Shiller (1988) for predicting equity earnings. 
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Table 3 
Parameter estimates, t-values and summary statistics for the United Kingdom 
1980/1-2000/4 1980/1—2003/4 
Aln P; Aln Er AlnY,; âln A; Aln E; Aln Y; 
Aln P; —0.129 0.057 0.192 —0.036 0.043 0.179 
(—1.121) (0.811) (1.724) (—0.328) (0.715) (1.897) 
Aln E; —0.079 0.202 0.152 0.047 0.176 0.160 
(—0.438) (1.841) (0.870) (0.255) (1.734) (1.012) 
Aln¥,_} —0.132 0.042 0.127 —0.142 0.043 0.104 
(—1.028) (0.541) (1.026) (—1.112) (0.611) (0.941) 
Inl —0.138 0.079 —0.059 —0.042 0.049 —0.067 
(—2.296) (2.166) (—1.019) (—0.925) (1.921) (—1.709) 
c 0.700 —0.370 0.266 0.224 —0.220 0.304 
(2.410) (—2.096) (0.947) (1.026) (-1.831) (1.622) 
R? 0.107 0.099 0.055 0.035 0.086 0.065 
Adj. R? 0.061 0.052 0.006 —0.009 0.045 0.023 
S.E. equation 0.078 0.048 0.076 0.086 0.047 0.074 
Prob(F -statistic) 0.065 0.088 0.355 0.529 0.088 0.193 
Log likelihood 95.029 135.888 97.838 99.687 155.617 113.833 
Mean dependent 0.031 0.016 —0.012 0.024 0.014 —0.011 
S.D. dependent 0.081 0.049 0.076 0.086 0.049 0.075 


Notes. The table reports parameter estimates and ż -statistics (in parenthesis) from the VEC model for two different 
subsamples, 1980/1-2000/4 and 1980/1-2003/4. As summary equation statistics the table reports R-squared (R2) 
and adjusted RÊ values, equation standard errors (S.E.), p-values of F -statistic for testing the joint significance of 
the regressors, log likelihood function values as well as mean and standard deviation of the dependent variables. 
If the reported p-value of the F-statistic is less than a specified significance level, say 0.05, the null hypothesis 
that all equation coefficients are equal to zero is rejected. The datastream UK market price index is used to present 
the equity prices and the earnings are calculated from the reported P/E ratios. The bond yields correspond to the 
Datastream UK ten-year government benchmark bond index. 


In the UK, the indicator is the only significant regressor explaining the variations of 
the equity prices and earnings during the first considered sample. However, this effect is 
alleviated when the whole dataset is used for estimation. The model does not explain the 
observed variations in the time series very well, which is highlighted by the low adjusted 
R? and high p-values of the F-statistics, see Table 3. 

In Germany, the indicator is again the most significant regressor in the model, by 
entering the earnings and bond yield equations with significant coefficients in the 1980/1- 
2000/4 sample, and the bond yield equation in the full sample. The earnings’ fluctuations 
are explained by the changes in equity prices and the bond yields are strongly autocorre- 
lated in both samples. 

Table 5 reports multivariate residual test statistics for the regressions. The tests reveal 
no autocorrelation or heteroskedasticity problems in the residual time series. However, the 
normality assumption of the residuals is rejected at a 1% significance level in the first 
subsample of the UK data and for the whole sample the residuals seem to be non-normally 
distributed in all the three countries. 

The indicator is able to explain some of the observed variations in the time series in all 
three markets. Especially, in the US and UK the indicator predicts the changes in equity 
prices, but it also appears as a significant regressor for bond yields in the US and Germany 
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Table 4 
Parameter estimates, t-values and summary statistics for Germany 
1980/1-2000/4 1980/1-2003/4 
AlnP; Aln E; Alny; AnP; Aln E; Alny; 
Aln P, 0.045 0.161 0.040 0.060 0.215 0.046 
(0.395) (1.556) (0.552) (0.540) (2.596) (0.732) 
Alh E; 0.031 0.156 —0.076 0.093 0.087 —0.028 
(0.254) (1.391) (—0.969) (0.685) (0.859) (—0.363) 
AlnyY;_) —0.256 —0.105 0.272 —0.291 —0.078 0.224 
(—1.617) (—0.730) (2.691) (— 1.638) (—0.588) (2.234) 
Infy-} —0.111 0.113 —0.136 —0.037 0.031 —0.052 
(—1.814) (2.017) (—3.479) (—0.848) (0.962) (—2.140) 
c 0.549 —0.526 0.639 0.184 —0.137 0.239 
(1.888) (—1.990) (3.456) (0.909) (—0.906) (2.092) 
R2 0.093 0.118 0.178 0.052 0.112 0.086 
Adj, R? 0.046 0.072 0.135 0.009 0.072 0.045 
S.E. equation 0.095 0.087 0.061 0.112 0.084 0.063 
Prob( F -statistic) 0.105 0.044 0.004 0.312 0.031 0.088 
Log likelihood 78.966 86.809 116.036 74.680 101.851 128.508 
Mean dependent 0.025 0.014 —0.006 0.018 0.014 ~—0.007 
S.D. dependent 0.098 0.090 0.065 0.113 0.087 0.065 


Notes. The table reports parameter estimates and r-statistics (in parenthesis) from the VEC model for two different 
subsamples, 1980/1-2000/4 and 1980/1-2003/4. As summary equation statistics the table reports R-squared ( R?) 
and adjusted R? values, equation standard errors (S.E.), p-values of F-statistic for testing the joint significance of 
the regressors, log likelihood function values as well as mean and standard deviation of the dependent variables. 
If the reported p-value of the F-statistic is less than a specified significance level, say 0.05, the null hypothesis 
that all equation coefficients are equal to zero is rejected. The datastream Germany market price index is used to 
present the equity prices and the earnings are calculated from the reported P/E ratios. The bond yields correspond 
to the Datastream Germany ten-year government benchmark bond index. 


Table 5 
Multivariate residual test statistics 

1980/1-2000/4 1980/1-2003/4 

US UK GER US UK GER 
Autocorrelation LM(4) 0.943 0.483 0.631 0.628 0.812 0.519 
Normality 0.096 0.001 0.014 0.003 0.000 0.004 
Heteroskedasticity 0.243 0.444 0.286 0.148 0.726 0.604 


Notes. The table reports multivariate residual test statistics for the regressions. The numbers are p-values of the 
statistics in the first column. The Lagrange multiplier (LM) test reports the multivariate test statistics for residual 
autocorrelation up to the specified order, h = 4. Under the null hypothesis of no serial correlation of order 4, the 
LM statistic is asymptotically x? distributed with k? degrees of freedom (Johansen, 1995), The normality test 
reports the multivariate extensions of the Jarque—Bera residual test (Doornik and Hansen, 1994), which compares 
the third and fourth moments of the residuals to those from the normal distribution. The reported value is the 
probability that a Jarque—Bera statistic exceeds (in absolute value) the observed value under the null hypothesis. 
A small probability value leads to the rejection of the null hypothesis of a normal distribution. Heteroskedasticity 
test is the extension of White’s (1980) test to systems of equations as discussed, e.g., by Doornik (1995). The 
test regression is run by regressing each cross product of the residuals on the cross products of the regressors and 
testing the joint significance of the regression. Under the null of no heteroskedasticity, the non-constant regressors 
should not be jointly significant, see Doornik (1995). 
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as well as for earnings in the UK and Germany. However, the extent of its forecasting 
power is somewhat questionable due to the low adjusted R? values reported in Tables 2-4. 
To further analyze the forecast ability of the indicator we compare the developed VEC 
model against VAR models in out-of-sample forecast tests in the next section. 


5. Forecast tests with the Fed model 


We compare the predictive ability of the VEC-models in out-of-sample tests against 
VAR-models for differences. The VAR-models are obtained by dropping the equilibrium 
correction term (the FED-indicator), so the focus is on the predictive ability of the log in- 
dicator. The models’ forecast performance is tested in a 1-step ahead forecast comparison 
using recursive estimation, where the models are re-estimated each time a new prediction 
has to be constructed. Initially, the models are estimated using a data sample t = 1980/1 
to 1989/4, and a 1-step ahead forecast is constructed using the estimated parameters. After 
this, the estimation period is expanded by one observation at a time (i.e., £ = 1980/1 to 
1990/1, £ = 1980/1 to 1990/2, etc.) and a new I-step prediction is constructed using the 
re-estimated parameters. We use the root mean squared forecast error (RMSE) (see, e.g., 
Makridakis et al., 1998) as a criterion for measuring the forecast accuracy of the differ- 
ent models. The VAR and VEC models’ RMSEs on two different subsamples, 1990/1 to 
2000/4 and 1990/1 to 2003/4, are given in Table 6. 

In the first subsample, 1990/1 to 2000/4, the VEC models outperform the VAR models 
by producing smaller forecast errors for the log differences of all the forecasted time series 
in all the three markets. The same general observation applies also in the second sample, 
1990/1 to 2003/4, although the prediction errors generated by the models are slightly larger 
when the observations from 2001/1 to 2003/4 are included in the forecast test. Apart from 
forecasting the changes in the German log earnings, VEC models outperform VAR models 
by generating smaller or equal forecast errors for all the time series. It appears that the in- 
dicator has predictive power in forecasting quarterly differences of equity prices, earnings 


Table 6 

Root mean squared errors for 1-step ahead forecasts 

Forecasted US UK GER 

variable VAR VEC VAR VEC VAR VEC 
Forecast sample 1990/1-2000/4 

AlnP 0.068 0.067 0.072 0.068 0.098 0.094 
AlnE 0.025 0.024 0.039 0.038 0.073 0.073 
AlnY 0.082 0.079 0.074 0.073 0.064 0.061 
Forecast sample 1990/1-2003/4 

Aln P 0.080 0.077 0.082 0.082 0.120 0.117 
AINE 0.030 0.029 0.040 0.039 0.068 0.071 
Alny 0.091 0.086 0.073 0.070 0.063 0.063 


Notes. The table reports 1-step ahead RMSEs of VAR and VEC models for two different out-of-sample forecast 
periods. The model parameters are re-estimated every time a new prediction has to be constructed. 
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and bond yields, even though the reductions in the forecast errors produced by the indicator 
are somewhat limited. 


6. Conclusion 


This paper developed a logarithmic Vector Equilibrium Correction version of the Fed 
model. The results of Section 4 show that the model is statistically significant in explaining 
variations in the logarithmic values of stock prices, earnings and bond yields. Moreover, 
in the forecast tests of Section 5, the VEC-model outperforms the VAR-model obtained by 
dropping the log indicator from the VEC-model (and re-estimating the parameters). These 
results support the earlier findings of Ziemba and Schwartz (1991), Berge and Ziemba 
(2003), Ziemba (2003) and Campbell and Vuolteenaho (2004). Consistent with Berge and 
Ziemba (2003), our results indicate that the Fed model is more successful in the US than 
in the other markets. Our results do not validate the logic behind the Fed model but they 
do show that during 1980-2003 the Fed model has had some predictive power in the US, 
the UK and German markets. 
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Chapter 20 


The Predictive Ability of the 
Bond-Stock Earnings Yield 
Differential Model 


KLAUS BERGE, GIORGIO CONSIGLI, AND WILLIAM T. ZIEMBA 


he Federal Reserve (Fed) model 

provides a framework for discussing 

stock market over- and undervalu- 

ation. It was introduced by market 
practitioners after Alan Greenspan’s speech on 
the market’s irrational exuberance in 
November 1996 as an attempt to understand 
and predict variations in the equity risk pre- 
mium (ERP). The model relates the yield on 
stocks (measured by the ratio of earnings to 
stock prices) to the yield on nominal Treasury 
bonds. The theory behind the Fed model is 
that an optimal asset allocation between stocks 
and bonds is related to their relative yields and 
when the bond yield is too high, a market 
adjustment is needed resulting in a shift out of 
stocks into bonds. Ifthe adjustment is large, it 
causes an equity market correction (a decline 
of 10% within one year); hence, there is a 
short-term negative ERP. The model predicted 
the 1987 U.S., 1990 Japan, 2000 U.S., and 
2002 U.S. corrections. 

A similar valuation model using the 30- 
year long bond rate and the equity yield dif- 
ference as the underlying measure of relative 
equity market valuation was suggested by 
Ziemba and Schwartz [1991] (see also Ziemba 
[2003], and Koivu, Pennanen, and Ziemba 
[2005}). The partial predictability of stock 
returns has been analyzed and confirmed by 
Fama and French [1988b], and Poterba and 
Summers [1988]. The ability of financial and 
accounting variables to predict individual stock 
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prices and stock indices has been studied by 
Campbell [1987, 1990, 1993]; Campbell and 
Shiller [1988]; Campbell and Yogo [2006]; 
Fama and French [1988a, 1989]; Goetzmann 
and Ibbotson [2006]; Jacobs and Levy [1988]; 
Lakonishok, Schleifer, and Vishny [1994]; Polk, 
Thompson, and Vuolteenaho [2006], and 
Ziemba and Schwartz [1991, 2000]. 

The Fed model has been successful in 
predicting market turns, but in spite of its 
empirical success and simplicity, the model has 
been criticized. First, it does not consider the 
role played by time-varying risk premiums in 
the portfolio selection process, yet it does con- 
sider a risk-free government interest rate as the 
discount factor of future earnings. More seri- 
ously, the inflation illusion (the possible impact 
of inflation expectations on the stock market) 
as suggested by Modigliani and Cohn [1979] 
is not taken into consideration. Second, the 
model assumes the comparability of an earn- 
ings-to-price ratio (EP), a real quantity, to a 
nominal, bond-induced interest rate (Campbell 
and Vuolteenaho [2004]). Asness [2000, 2003], 
and Ritter and Warr [2002] discussed these 
issues. Consigli, MacLean, Zhao and Ziemba 
[2006] propose a stochastic model of equity 
returns based on an extension of the Fed model 
inclusive of a risk premium in which market 
corrections are endogenously generated by the 
bond-stock yield difference. The model 
accommodates both cases of prolonged yield 
deviations leading to a long series of small 
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declines in the equity market and the case, which is pecu- 
liar to recent speculative bubbles, of a series of corrections 
over limited time periods. The inclusion of the yield dif- 
ferential as a key driver of the market correction process 
is tested and the model is validated with market data. 

The next, or second, section discusses the bond—stock 
valuation model and shows its original application to the 
October 1987 correction. Hence, the ERP is not con- 
stant and thus is partially predictable. The third section 
discusses overvaluation and possible forthcoming correc- 
tions. The fourth section discusses empirical evidence from 
the U.S., U.K., Japan, Germany, and Canada. The fifth 
section analyzes endogenous instability, risk premiums, 
and the nominal-versus-real variable issue. The last section 
concludes. 


VALUATION MODEL, EQUITY YIELD, 
AND 10-YEAR INTEREST RATE 


The Fed model in its original 1996 form states the 
dependence of a fair stock price level, p(#), at time t to 
the expected earnings, E(#), and the 10-year Treasury 
rate, r(t). Earnings expectations are fully incorporated in 
prices and discounted at the current 10-year risk-free 
rate for t = 1, 2, ..., 


a(t) = FO 
P(t) = 0) (1) 


Earnings per share, y(t), are computed by equity 
investors as the expected earnings for a given unit invest- 
ment in the stock market with equity shares, S(t), 


y(t) = T (2) 


then S()= FR There is a direct relationship between 
the equity yield in Equation (2) and the risk-free rate in 
Equation (1). The ratio of the current market value to 
the theoretical value is the bond-stock yield ratio, BSYR(t), 


rt) _ 
z0 |= BSYR(t) 


) (3) 
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The bond-stock earnings yield differential, 
BSEYD(t) = r(t) — y(t), is related to the (random) valua- 
tion measure and the equity yield, 


S()__, _ BSEYD() 


4 
O x0) = 


BSEYD(t)=[ BSYR(t)—1]y(¢) (4b) 


The differential reflects the difference between the 
current market value and its theoretical value. A more 
theoretically sound motivation for the predictive ability 
of the BSEYD is the basic Gordon formula, where EP is 
the forward earnings yield (which Schwartz and Ziemba 
[2000] show is the best predictor of at least individual 
Japanese stock prices), 

EP — nominal yield = equity risk premium — real 
growth — inflation, so the BSEYD can be used as a proxy 
for the unobservable right-side economic variables. 

For a given equity yield, the BSEYD and the BSYR 
can be used to identify zones of under- and overvaluation 
and to forecast possible forthcoming market adjustments. 
Under the Fed model assumptions, we expect the market, 
even with prolonged deviations, to fluctuate around its 
theoretical value and, thus from time to time, the BSYR. 
should converge to one and the BSEYD to zero. Alter- 
natively, a market risk premium should be accounted for 
and future earnings accordingly discounted by an interest 
rate above the prevailing risk-free rate (10-year maturity 
rates in the canonical Fed model assumption). 

A classic example of the use of this model is the 
stock market correction in October 1987 illustrated in 
Exhibit 1. It is from this application that Ziemba’s Yamaichi 
Research Institute study group in 1988 formulated the 
BSEYD measure described in Ziemba and Schwartz 
[1991], and Ziemba [2003]. The danger zone was entered 
in May 1987 and the correction occurred in October, 
four months later. During June, July, and August, investors 
kept rebalancing their portfolios from the bond to the 
equity market (MSCI TRI + 13.87% over the quarter). 
The equity market then fell -31.80% in the following 
quarter (September to November 1987) with the majority 
of the decline occurring in October. In order to validate 
the approach, Berge and Ziemba [2006] used cight dif- 
ferent strategies to test the predictive ability of the BBEYD 
or, equivalently, the BSYR (see Exhibit 2). The strategies 
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EXHIBIT 1 
Behavior of BSEYD in U.S. Around October 1987 


Stock Market Correction 
10Y T-Bond EP MSCI TRI 
Date Yield Ratio BSEYD Return in % 
Sep-86 7.45 7.19 0.26 -8.50 
Oct-86 7.43 6.81 0.62 5.68 
Nov-86 7.25 6.78 0.47 2:35 
Dec-86 FANI 7.07 0.04 -2.52 
Jan-87 7.08 6.26 0.82 13.28 
Feb-87 7.25 5.96 1.29 3.59 
Mar-87 T25 5.76 1.49 2.70 
Apr-87 8.02 5.8 2.22 -0.90 
May-87 8.61 5.85 2.76 0.56 
Jun-87 8.40 5.59 2.81 5.18 
Jul-87 8.45 5.36 3.09 4.75 
Aug-87 8.76 4.88 3.88 3.94 
Sep-87 9.42 5.03 4,39 -2.13 
Oct-87 9.52 6.35 3.17 -21.22 
Nov-87 8.86 7.38 1.48 -8.45 
Dec-87 8.99 6.87 2.12 7.04 
Jan-88 8.67 6.66 2.01 4.28 
Feb-88 8.21 6.86 1.35 4.19 
Mar-88 8.37 7.14 1.23 -3.35 
Apr-88 8.72 7.08 1.64 0.92 
May-88 9.09 751 1.58 0.86 
Jun-88 8.92 7.29 1.63 4.65 
Jul-88 9.06 7.32 1.74 -0.31 
Aug-88 9.26 8.77 0.49 -3.35 


Source: Berge and Ziemba [2006]. 


These 60 or 120 observations, respectively, inclu- 
ding the value of the current month, define the threshold 
levels for the end of that month. As time evolves, the 
most recent monthly value is added and the oldest value 
is omitted. Moving intervals diminish the impact of 
changes in the regulatory framework of the economy 
on EP ratios over time, since high past values of the 
BSEYD are eventually excluded. For the historical values 
of the BSEYD, two different distributional concepts are 
used to define the predetermined exit and entry threshold 
levels: normal and historical. In the case of normality, the 
mean and standard deviation of the historical range of 
BSEYDs are 


iti 
BSEYD.a =- $, BSEYD, 6) 


j=i-d+1 


and 


Eag — , 
Spsevn,, “ft >, (BSEYD, ~BSEYD,.)* (6) 


d= j=t-d+1 


respectively, where d is the length of the interval consid- 
ered. The exit threshold level is 


TLOUN? = BSEYD,a +z, . 5 (7) 
are defined over either 5- or 10-year time-data intervals, aes Bed 
under normal or historical return distribution assump- dth hreshold level i 
tions, and different fractile levels on such distributions for ane Hae entry. PESTO Evens 
exiting and entering the equity market. The threshold h.ND _ ROEY 
ees TL, = BSEYD a + Zy - Sgse (8) 
levels were specified for the BSEYD measure. t4 @," BSEYD, q 
EXHIBIT 2 
Strategies Used to Evaluate the Robustness (Predictive Ability) of BSEYD-based Strategies 
= Length of Determining Fractile for Exit Fractile for Entry 
Strategy Interval (Years) Distribution Type Threshold Level (a,) Threshold Level (a,) 
T 5 Historical 90% ~ 80% 
2 5 Historical 90% 70% 
3 5 Normal 95% 85% 
4 5 Normal 95% 75% 
5 10 Historical 95% 85% 
6 10 Historical 95% 75% 
7 10 Normal 90% 85% 
8 10 Normal 90% 80% 


Source: Berge and Ziemba [2006]. 
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where Z on and Z yz are the œ- and a,-fractiles, respec- 
tively, of the standard normal probability distribution. The 
second concept calculates the œ- and œ, fractiles of the 
historical distribution. The a-fractile of d observations is 


Ka = Ky + erry Xu) + (I> @) 0) 
where k = d- œ and xq, is the k- smallest value of the d 
observations starting at the end of month (t— d + 1) and 
ending at the end of month ¢. 

The strategies are evaluated over the 1979-2005 
period in five equity markets—the U.S., U.K., Germany, 
Canada, and Japan. The stock market index data are from 
Morgan Stanley Capital International (MSCI) and utilize 
a uniform methodology across all markets. The total return 
indices (TRI) are calculated with gross dividends (before 
taxes) reinvested to estimate the total return on the market 
that would be achieved by reinvesting one-twelfth of the 
annual dividend reported at every month-end. The price- 
to-earnings (PE) ratios provided by MSCI refer to the 
companies in the respective MSCI TRI. They are the 
sum of the weighted latest share prices in local currency 
divided by the sum of the weighted most recent trailing 
12-month earnings (losses) per share. MSCI does not esti- 
mate future earnings, so only past earnings are included 
in their data. Actual earnings are entered in the database 
as soon as they are reported at an adequately detailed level. 
The advantage of the MSCI indices over other indices 
is that even when the overall market experiences losses, 
PE ratios, in this case negative, can be calculated. For cash 


EXHIBIT 3 


equivalents, the yield on 3-month T-bills is used. Data 
for 3-month T-bills are from Global Financial Data (GFD) 
and Thomson Financial Datastream. 

Data on 10-year government bond yields come from 
various sources. The U.S. data are from the Federal 
Reserve. Yields are of constant maturity interpolated from 
the daily yield curve. Data for Germany were supplied 
by the Deutsche Bérse AG and are calculated similarly as 
for U.S. data. Data for the U.K. and Japan come from 
GFM. These yields refer to government bonds with matu- 
rities closest to 10 years. Data for Japan are available begin- 
ning in May 1980. Japanese T bill data were not updated 
monthly until December 1981. Canadian government 
bond yields were obtained from the Bank of Canada. 

For the first strategy, five years of monthly obser- 
vations are used prior to 1979 to estimate the historical 
(or frequency) distribution of the 90% and 80% fractiles. 
Every month the data includes the realized monthly 
return and discards the oldest observation to keep the 
five-year window constant, Every strategy has well- 
defined equity market entry and exit values—once in, 
the strategy suggests 100% in equities, once out 0% (i.e., 
a 0-1 stock—bond strategy). The appendix contains results 
for the five markets for the 35-year period 1970-2005. 
Every fractile pair defines the market danger zone used 
in defining optimal investment strategies and portfolio 
rebalancing. 

Strategy 6 was preferable in terms of total return 
over the 25-year period 1980-2005 in the U.S. market. 
Exhibit 3 shows exiting and entering months according 


Required Exit and Re-Entry Fractiles for Strategy 6 for Optimal Performance 


Value of BSEYD ~ 
(Previous Month)/ Value of BSEYD/ 
Corresponding Corresponding 
Correction Required Exit Required Re-Entry 
Number Start Date F. ractile . End Date Fractile 
3 April 1981 1.74 / 97.5% September 1981 2.04 / 96.5% 
4 December 198! 1.26 / 87.5% July 1982 0.00 / 65.0% 
a September 1987 3.88 / 99.0% November 1987 1.48 / 55.0% 
6 June 1990 2.31 / 77.0% October 1990 1.40 / 41.0% 
7 July 1998 1.83 / 62.0% August 1998 0.96 / 25.0% 
8 April 2000 2.99 / 94.0% September 2001 1.11 / 18.5% 
9 April 2002 2.39 / 84.0% September 2002 -0.32 / 0.0% 


Note: Corrections 1 and 2 predate April 1981. See Exhibit Al in the appendix. 
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to these thresholds and the BSEYD signal in the previous 
month. 

Exhibits A1—-A10 in the appendix show the fol- 
lowing for each of the five countries: stock market cor- 
rections (declines of more than 10%) and their durations, 
as well as the results of the strategies in terms of percent 
of time in the stock market, mean log return, standard 
deviation, Sharpe ratio, mean excess return, and terminal 
value. All criteria are compared to actual stock market 
total returns from 1975-2005 for Strategies 1 to 4 and 
from 1980-2005 for Strategies 5 to 8. Dimson, Marsh, and 
Staunton [2006, 2007] presented returns over more than 
100 years, but did not discuss partially predictive invest- 
ment strategies as did Berge and Ziemba [2006]. 

The U.S. had 9 corrections from 1975 to 2005. The 
terminal values, Sharpe ratios, and so forth all suggest that 
the strategies added considerable value. The main test is, 
are you better off being out of the stock market when 
the BSEYD measure suggests you should be? For example, 
$100 grew to $4,650 in the stock market and from $8,480 
to $10,635 with Strategies 1 to 4 for the period 
1975-2005. In addition, the portfolio variance was less 
because about 15% to 20% of the time one is out of the 
stock market. The period 1980-2005 had similar posi- 
tive results for the various strategies. 

Japan had 7 corrections including the 31-month 
56.2% decline starting in January 1990, Ziemba [2003], 
and Ziemba and Schwartz [1991] provided calculations 
showing that this correction was the most in the danger 
zone of any of the measures in the 42-year period 
1948-1990, Ziemba and Schwartz found that every time 
the BSEYD measure was in the danger zone during the 
period 1948-1988, there was a correction of at least 10% 
from the current stock index level. The BSEYD measure 
produced no misses (12/12), but only 12 of the 20 cor- 
rections from 1948 to 1988 resulted from its predictive 
ability. The other 8 corrections were instigated by other 
causes. The results for Japan, as for the U.S., are good for 
the measure: ¥100 grows to ¥213.88 with the market 
index from 1985 to 2005, but to ¥455.72 to ¥498.62 
with Strategies 1 to 4. Strategies 5 to 8 also had good pre- 
dictive results for the period 1990-2005. 

The results for the other three countries were not 
as good in terms of final wealth levels over the period 
1975-2005. All the strategies, however, were superior to 
a buy and hold strategy in terms of Sharpe ratios and in 
the final wealth levels during the period 1980-2005. For 
the U.K., most of the strategies outperformed the stock 
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market return of the buy and hold strategy, but some 
strategies did underperform and the outperformance of 
the winning strategies was minimal. 

The U.K. had 15 corrections, mostly of short dura- 
tion, but the 17.9% decline of September 2000 lasted 
13 months and the April 2002 decline of 31.6% lasted 
10 months. 

The results in Canada were similar to those of the 
US. with 5 of its 12 corrections having declines of over 
20%, including a 41.2% decline in September 2000 over 
13 months. With 13 corrections, Germany also had sim- 
ilar results with the March 2000 decline of —-44% lasting 
19 months and the April 2002 decline of —53.9% spread 
over 12 months. 

The basic conclusion is that the bond-stock model 
has had positive predictive ability, working as well as it 
has in these five countries, especially in the 1980-2005 
period, with the results in the U.S. and Japan the best. 
Portfolio strategies based on the bond~stock relative val- 
uation measure have produced superior returns relative 
to a buy and hold strategy. 


OVERVALUATION AND FORTHCOMING 
CORRECTIONS 


The evidence shows that the effectiveness of the 
bond-stock yield difference measure varies from market to 
market and additional indicators may be required to generate 
robust strategies across markets and times. We focus on the 
properties of the yield ratio as a signal of forthcoming market 
corrections, thus suggesting an immediate exit from the 
equity market, but investors will follow different strategies. 

A critical view on the role of the bond-stock yield 
differential as a predictive variable of future market adjust- 
ments depends on the market evidence. Consider the fol- 
lowing alternative scenarios from which an indirect 
economic validation of the approach can be derived: 


1. The stock market is overvalued according to the 
bond-stock yield differential and a subsequent 
market correction is observed; this evidence sup- 
ports the suggested approach. 

2. Market corrections are observed even if the yield 
differential is within the no-danger zone. 

3. Even if the yield differential remains for an extended 
period in the danger zone, no market corrections occur. 

4. Market corrections do not occur for an extended 
period of time and the yield differential is around zero. 
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The first and fourth market conditions support the 
BSEYD model and suggest that a large difference between 
the earnings per share at an aggregate level in the economy 
and the 10-year bond rate is a sufficient condition for a 
subsequent correction. 

The second and third conditions suggest that 
investors act independently and do not rely on the 
bond-equity yield differentials to determine their port- 
folio strategies and the strategic bond—equity portfolio 
composition. If corrections are observed when the yield 
differential is relatively low, this implies that the yield dif- 
ferential may very well provide a sufficient, but not nec- 
essary, condition for corrections. In this case, other 
variables are determining the adjustment. Finally the 
absence of corrections in the presence of persistently high 
differentials would contradict the sufficient condition. 

We directly test the short-term predictive power of 
the yield spread with respect to market returns, verifying 
the market response to diverging bond and stock yields 
observed in previous periods. A BSEYD in the upper 
tail of the frequency distribution should, in general, antic- 
ipate a market correction of varying intensity. To docu- 
ment short-term predictability we use a simple predictive 
regression model based on monthly returns in the equity 
market and one-, two-, and three-month lagged yield 
spreads. The bivariate prediction equation for each market k 
(U.S., U.K., Japan, Germany, and Canada) is 


P,(t)= 4, +6, [y(t- h)2 zim +e Oh 
=1,2,3 k=1,...,5 (10) 


where 

P,(t) is the month-to-month log equity return for 
market k, over (t—1, f, namely In (23) > ¥y(t— h) is the 
BSEYD or yield spread at time t — h one, two, and three 
months earlier, and Zh oe is the 90th percentile of the 
yield spread historical distribution estimated at time t — h 
over the market history (t— h — H, t— h), where H defines 
a 15-year data history. 

For every f, starting with the period January 1970 
to December 1984, we estimate the frequency distribu- 
tion on this window, first identifying the 90% percentile 
and then moving forward one month, then identifying 
the associated 90% percentile, and repeating with a con- 
stant time window. 
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Once such a series of critical values has been iden- 
tified, we test for t = Jan1985,..., Dec2004, to determine 
if the current yield spread falls above the current critical 
value, namely, y, (t — h) Szk oog: Then, conditional on 
the yield spread excess, we determine the market return 
one, two, and three months ahead, and identify the resid~ 
uals for each month and market. In this way, we estimate 
directly from the tail of the distribution the market-specific 
coefficients a,, b, over the 20-year period. 

This procedure tests the market return dependence 
on the one-, two-, and three-month lagged yield spread, 
conditional on such spread being above the 90th yield 
percentile. A statistically significant relationship between 
p,{(t)and y,(t-h), h = 1, 2, 3 for market k is not sufficient 
to assess the predictive power of the yield spread with 
respect to the equity return. A negative coefficient b, is 
needed with residuals normally distributed with mean 0 
and variance e,(t) € N (0, 07). Exhibit 4 reports the 
results. (See also Exhibit 5 for the one-month predictive 
regressions in the U.S.) The second and third columns of 
Exhibit 4 give the means and standard deviations of 
monthly returns conditional on the yield spread being 
inside the time varying 10% upper tail. The regression 
results and mean and variance of the computed residuals 
are given for the selected subsamples, and depend on 
market volatility. 

The yield spread has some short-term predictive 
power in the U.S. and U.K. markets, but very little in the 
Japan market and none in the German market. We are 
not testing the effectiveness of the yield spread as the dri- 
ving variable of a strategic portfolio allocation, but only 
the dependence of short-term market corrections (not 
beyond three months) on previously observed bond-stock 
yield excesses. 


High Yield Difference with Corrections 


The BSEYD model relies on the evidence that, at 
current market values, improved expectations of com- 
pany earnings drive up price yields making equity invest- 
ments more attractive than fixed-income investments. 
This change in yields pushes up the demand for stocks and 
triggers a portfolio rebalancing from bonds to stocks. 
In turn, this shift pushes stock yields down toward their 
long-term equilibrium value which is typically inclusive 
of a small ERP over the risk-free rate. If, however, earn- 
ings expectations keep rising, stock prices may keep rising 
as well. This drives the market dynamics in such a way 
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EXHIBIT 4 
One-, Two-, and Three-Month Predictive Power of BSEYD for Various Equity Markets, 1985-2005 
Jan 1985 -- Dec 2005 MSCI monthly data 
conditional | conditional residuals 
US Equity mean std dev variance mean 
ag 1| -0.008059506! 0.071475813) 0.002961, 0.000466996 
ag 2| -0.004050033) 0.074122486 
ag 3| -0.009414431) 0.071723591 
UK Equity 
ag 1| _-0.01566462| 0.089364691 a at 
ag 2| -0.028584323} 0.089782349 0.005599| 4.08603E-05 
ag 3| -0.008372955| 0.051777963) | 
JPN Equity 
ag 1| -0.032871424} 0.08874209 0.007375] 9.60305E-05} 
ag 2] -0.02053667; 0.091968716 
ag 3) -0.023069087) 0092978531 ] 
GER Equity l 
ag 1 -0.003929792| 0.060827994 0.003623] -0.000123067 
ag 2| -0.004137307| _0.060351908 
ag 3| -0.000313041 | 0.061393217| 
Note: No results are reported for Canada because no yield excesses were reported over the 20-year period from January 1985 to December 2005. 
EXHIBIT 5 
One-Month Predictive Regressions of BSEYD for Various Equity Markets, January 1985-December 2005 
US market — MSCI TRI UK market — MSCI UK TRI 
1 month predictive regression 1 month predictiva regression 
3 
yield spread at (t-1) yield spread (t-1) 
JPN Stock market - MSCI JP TRI GER stock market -- MSC] GER TRI 
1 month predictive regression 1 month predictive regression 
é 
i, 
2 
ws 
35 a a 5 ss 2 3 z 5 £ 7 
yield spread (1) yield spread @-1) 
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that the market price yield does not stabilize around a 
consensus long-term level. 

A condition of instability may emerge in this setting 
if the 10-year interest rate also increases. Several factors 
explain this possibility: 


1. The selling of 10-year T-notes to finance stock 
market investments. 

2. A shift of the yield curve induced by Fed policy in 
order to offset the inflationary impact on con- 
sumption from the stock market wealth effect. 

3. The possible overheating of the economy associated 
with improved earnings expectations, partly due to 
the persistence of market increases. 

4. An increased demand for long-term funding to 
finance equity investments. 


This last scenario, in spite of an assumed improve- 
ment in earnings expectations, generally induces a modest 
increase in the theoretical equilibrium stock market value, 
thus supporting a growing deviation of the actual market 
capitalization from the long-term theoretical value. 

Such a deviation will persist and fuel increasing for- 
ward instability until interest rate tightening is no longer 
perceived by the market as being sufficient to reduce the 
expected improvement in companies’ earnings. From a 
different viewpoint, investors initially begin rebalancing 
from bonds to equity then, relying on the positive market 
outlook, begin borrowing to move further up on the 
efficient frontier in order to maximize expected return. 
The subsequent market adjustment will largely be due to 
the sudden and substantial inversion of relative values. 
Under these circumstances, foreign investors will play a 
specific role; that is, in the presence of an increasing for- 
eign—domestic interest rate gap, foreign investors will be 
attracted to foreign equity markets to attempt to obtain 
superior returns and to gain from the exchange rate dif- 
ferential. Inflows from foreign markets will further hasten 
the departure of the market from its theoretical value. In 
this sequence of events, the 10-year Treasury rate will 
correspondingly deviate from the stock market yield. The 
1987 crisis and the comparatively smaller 1997 crisis are 
consistent with this observation. On October 17, 1987, 
the 10-year rate was 10.22%, increasing from 7.5% in 
March 1987. As shown in Exhibit 1, the BSEYD moved 
over the same period from 1.49 to 4.32 in September. A 
prolonged decrease of equity yields and increase of interest 
rates in a rapidly expanding economy may lead to a market 
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overvaluation and a subsequent market collapse when 
the BSEYD reaches a level unsustainable by the market. 


High Yield Difference without Corrections 


Can the BSEYD remain high on historical terms 
(above 2.03 which was the average between January 1, 
1985, and December 31, 2004) for an extended period 
of time when the market is relatively stable? Or, is it pos- 
sible to have a scenario in which the yield ratio is increasing 
and high, the ratio between actual and theoretical prices 
is increasing, and the equity market is increasing, without 
having a speculative bubble? 

Thus, in the case of stable earnings expectations and 
price yields, a reduction in market interest rates—due 
perhaps to a loose monetary policy by the Federal Reserve 
motivated by exogenous factors—to below the equity 
yield decrease will drive up the ratio, but without causing 
a bubble. Low interest rates will also limit the inflows 
from foreign equity markets and from currency revalua- 
tion. This was the situation in the U.S. market from 1991 
to 1992 and in 1999. Consider the 1999 period in which 
no corrections occurred; the BSEYD ratio remained above 
3.0 for a long period. The reason appears to be related to 
the fact that the market had a correction during the fall 
of 1998, was already regaining its pace, and a few months 
elapsed before the realization was made that the theoret- 
ical market value was stable and interest rates were low and, 
indeed, slightly decreasing. The bond-stock ratio increased 
during a period of low interest rates (below 6%) in the U.S. 
and increasing corporate earnings. If the equity yield is 
decreasing because of a sharply rising equity market and 
interest rates remain low or decreasing, a relatively high 
BSEYD can be associated with stable market expecta- 
tions. Then, a BSEYD in the danger zone is not sufficient 
to lead to a market correction. Inflation expectations play 
an additional role in market sentiment with decreasing 
interest rates conveying a positive sentiment on the market. 


Low Yield Difference with Corrections 


Finally, is a high BSEYD necessary to observe a cor- 
rection? Not always. Market corrections have been 
observed even when the bond-stock differential was low 
and stable; see Exhibit 4. Also, events like September 11, 
2001, can cause market corrections. In that case, the S&P 
500 fell by -14% in the week following the reopening of 
USS. markets after they had been closed for one week. 
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The case of market instability and a low BSEYD is 
crucial in assessing the validity of the indicators as valua- 
tion measures. One such example begins in the third quarter 
of 2002. After a smooth decrease of the BSEYD during July 
and August 2002, a sequence of large declines followed: 
—5,98% in the second week of September, —12.75% in the 
third week of September, and —6.49% in the first week of 
October. During this period, the BSEYD never reached 
the danger zone, remaining below the 2.85 level. The 10- 
year rate was stable, but began to decrease, falling below 
5% in October 2002. A common explanation is that the 
ratio had dramatically decreased from a May 2002 max- 
imum of 3.5 and the equity yield was on a positive trend 
at the time, anticipating the following market recovery. No 
market corrections have been recorded with a low BSEYD 
and an increasing equity market. 

In summary, the following aspects appear key to 
the ongoing debate regarding the robustness of the 
BSEYD as a tool for anticipating signals and thcir eco- 
nomic implications: 


1. The relative nature of the valuation measure implies 
that diverging bond and stock yields may lead to 
market corrections if the equity market has increa- 
sed rapidly over the preceding months due to 
bond-equity rebalancing and equity investments are 
funded through borrowing and incoming foreign 
capital, even with high and increasing interest rates. 
In this case, the degree of market internationaliza- 
tion may become relevant. A high BSEYD alone, 
however, is neither sufficient nor necessary to antic- 
ipate a market crisis. 

2. The stock market phase appears crucial. A high yield 
differential is consistent with either an increasing or 
decreasing equity market; in both cases, the market 
adjustment will depend on the relative dynamics of 
the 10-year rate and price yield, conditional on 
recent market performance. 

3. Finally, interest rates and inflationary expectations 
are relevant. Increasing, but relatively low, interest 
rates have a positive effect on market expectations 
and usually accompany a smooth convergence of 
equity prices to theoretical values, avoiding market 
corrections. 


Also a period of instability may be induced by an 


exogenous correction or crisis in a different market (i.e., 
independently of the local current yield ratio). 
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U.S. AND WORLDWIDE RESULTS 


We review the evidence in Berge and Ziemba [2006] 
from five equity markets. The second section and appendix 
show that being out of the market when the BSEYD 
measure indicates an investor should be out dominates 
buy and hold strategies in the U.S., U.K., Japan, Ger- 
many, and Canada during the period 1975-2005. Periods 
in the danger zone are identified by entry and exit sig- 
nals; the associated time periods can vary and the lack of 
correction occurrence is identified once the market is in 
the danger zone. Exhibit 6 shows that 34 corrections 
occurred while the yield differential was not in the danger 
zone; only 13 corrections were accurately called by the 
BSYED danger zone. 

Consider the first row of Exhibit 6. In the first 
column is a sequence of market corrections correctly 
anticipated by the yield differential. The second column 
shows the corrections that occurred when the BSEYD 
was not in the danger zone. During the 25-year history 
studied in the U.K., of 11 market corrections, only 1 
was anticipated by the BSEYD measure. Some of the 
recent corrections started in a foreign market and took 
on a domino effect as discussed further in the appendix. 
Exhibits 7 and 8 report the series of corrections observed 
in the US. and U.K. markets between 1980 and 2005. 
The overlapping corrections sequence, noted in bold in 
the exhibits, is only marginally due to internal 
bond-stock yield differentials. Before most of these cor- 
rections, the BSEYD was not in a danger zone. After 
the corrections, however, monetary authorities inter- 
vened, injecting liquidity into the markets or lowering 
interest rates in an attempt to limit the market's fall and 
its systemic consequences. As a result, the BSEYD was 
also reduced. 


EXHIBIT 6 

Evaluation of Forecasting Signals for Best- 
Performing BSEYD Strategy 6 in U.S., Germany, 
Canada, U.K., and Japan, 1980-2005 


BSEYD in Danger Zone BSEYD Not in Danger Zone 
(U.S., GER, CAN, U.K., JPN) (U.S., GER, CAN, U.K., JPN) 
Correction 
Occurred 3, 4, 3, 1,2 = 13 4, 8,7,10,5=34 
Correction 
Did Not Occur 2,3,1,3,0=9 3,4; 2, L112 


Source: Berge and Ziemba [2006]. 
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EXHIBIT 7 
Forecasting Signals and Corrections 


in the U.S. 1980-2005 
Correction Duration 
Number Start Date (Months) Decline (%) 
1 April 1981 6 -11.6 
2 December 1981 8 -11.1 
3 September 1987 3 -29.4 
4 June 1990 5 -14.1 
5 July 1998 2 -14.7 
6 April 2000 18 -31.2 
7 April 2002 6 -29.1 


Although the duration of the correction starting in April 2000 is longer 
than one year, it is considered a correction since the stock market dropped 
23.9% within the following 12 months. 


EXHIBIT 8 


Forecasting Signals and Corrections 
in the U.K. 1980-2005 


Correction Duration 
Number Start Date (Months) Decline (%) 
1 September 1981 1 -16.1 
2 October 1987 2 -33.4 
3 September 1989 2 -10.4 
4 January 1990 4 -12.3 
5 July 1990 3 -16.0 
6 June 1992 3 -14.9 
7 February 1994 5 -15.1 
8 May 1998 5 -14.3 
9 January 2000 2 -10.2 
10 September 2000 13 -17.9 
11 April 2002 10 -31.6 


Although the duration of the correction starting in September 2000 is longer 
than one year, it is considered a correction since the stock market dropped 
14.7% within 12 months between September 2000 and August 2001. 


The evidence on the predictive power of the 
BSEYD in the U.K. market is partially attributable to a 
global adjustment of international portfolios which started 
outside the U.K. Specifically, 4 of 11 market crises were 
imported from the U.S. market as part of a systemic adjust- 
ment. The June 1992 crisis was a by-product of the Euro- 
pean currency crisis and was imported from a different 
financial market. The results of the out-of-sample test of 
the strategies based on the BSEYD are variable across the 
five countries. Almost all strategies performed better than 
the stock market, as is shown in the second section of this 
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article and the appendix. The results in the U.S. and Japan 
were the best. All five countries experienced the situation 
in which not all corrections were predicted. 

The co-movement of the BSEYD and the respec- 
tive stock market demonstrates that some corrections are 
not detectable. The methodology of the strategies rec- 
ommends moving out of the stock market when the cur- 
rent value of the BSEYD is extremely high compared to 
its past values. Hence, a rise of the BSEYD prior to the 
occurrence of corrections is necessary to yield a successful 
prediction of these corrections. This behavior was observ- 
able for some of the corrections in the various countries. 
The stock market did not always fall after a sharp rise in 
the indicator, however, and other corrections occurred 
when the indicator was not at historically excessive levels. 
These corrections were not caused by disequilibrium in 
the yield on long-term government bonds and the EP 
ratio of the stock market. Only one of the corrections 
which occurred during 1995-2005 was predicted by the 
strategies. Two strategies for the U.K. gave correct signals 
for the correction starting in January 2000. 

The results demonstrate a relationship between the 
yield on long-term government bonds and the EP ratio 
of the stock market, at least for the extreme values of the 
spread between these variables. This relationship was 
proven to be exploitable for outperforming the stock 
market. Because the pattern is observable over a long 
period of time in several countries and is valid for most 
strategies, it is generally applicable and not data mining. 

If the strategies detect a negative return on the stock 
market for certain periods over a long time, this leads to 
the conclusion that, on average, a negative return on the 
stock market could be expected whenever the indicator 
moves into the danger zone. 

The assertion that trading profits suggest a semi- 
strong efficient markets hypothesis violation is true for all 
countries. Although the net mean returns on the strate- 
gies were not always clearly higher than those of the stock 
market, the standard deviation of the returns was always 
much below that of the stock market. So, risk-adjusted 
profits net of transaction costs were always positive with 
higher Sharpe ratios. The assertion that negative expected 
returns suggest even stronger market inefficiency cannot 
be generally supported. Negative stock market returns 
for out-of-the-market periods are observable for the U.S. 
and Japan, and only partly for Germany and the U.K. 
These returns are not significantly negative due to the 
high volatility of returns in these periods. 
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ENDOGENOUS INSTABILITY, RISK 
PREMIUMS, AND THE NOMINAL- VERSUS 
REAL-VARIABLE ISSUE 


Bond-stock yield differentials provide reliable fore- 
casting signals in conjunction with other financial vari- 
ables such as the level of interest rates, interest rate 
differentials, and equity market cycles. Campbell and 
Vuolteenaho [2004] criticized the Fed model, but admit 
that it has been “remarkably successful as a behavioural 
description of stock prices, and its descriptions of equity 
market equilibrium is superficially attractive.” Their crit- 
icism is inflation-based since the major factor determining 
the bond yield is the expected long-term inflation rate. 
Declining inflation raises bond prices causing yields to 
fall because the fixed nominal-value coupons are worth 
less. They then argue, correctly, that there is no reason 
stocks should have lower yields because stocks are claims 
on corporate profits which are not fixed in nominal terms. 
Therefore, profits can move in line with inflation. 

Campbell and Vuolteenaho rejected the two fol- 
lowing potential explanations: 


1. Inflation is bad for the real economy and corporate 
profits so that rising inflation lowers expected future 
profits and stock prices. 

2. Inflation increases uncertainty and, therefore, stock 
prices decline. 


Econometric studies since the 1920s have shown, 
however, that corporate dividends are not inversely related 
to inflation and bond yields. Also, the evidence does not 
support either of the two preceding explanations. 

But Campbell and Vuolteenaho are swayed by the 
Modigliani and Cohn [1979] argument that inflation con- 
fuses investors who use nominal-based yields to discount 
real corporate profits. During high-inflation periods, stocks 
are underpriced, but declining inflation eliminates this 
confusion. Campbell and Vuolteenaho observed that 
measures of the equity risk premium, based on the cross- 
sectional pricing of riskier stocks relative to safer stocks 
or on the relative volatility of stocks and bonds, are essen- 
tially unrelated to past inflation and bond yields. We agree 
with the Campbell and Vuolteenaho criticism and their 
attempt to argue why the bond~stock measures might 
work. Supporting this is the observation that investors are 
easily confused, relying on few, and sometimes changing, 
key factors and ignoring other relevant economic factors. 


SPRING 2008 


At the heart of the Fed model, Campbell and 
Vuolteenaho [2004] criticize the comparability of the 
Treasury rate with the S&P 500 earnings yield, the first 
being a nominal return strictly related to the expected 
rate of inflation and the second an aggregate claim to cor- 
porate profits. Bond investors are aware that bond prices 
vary in order to compensate for inflation expectations, 
which leads to nominal yields moving into line with the 
prevailing term structure of interest rates. In contrast, cor- 
porate earnings are not expected to be constant in nom- 
inal terms. The idea that investors compare the 10-year 
nominal Treasury rate with the S&P 500 real price yield 
is thus ill-founded and unsustainable according to this 
explanation. 

Consigli et al. [2006] tested the hypothesis that over 
the 1970-2005 period corrections in the U.S. market 
were consistent with the postulated Fed model. They 
found evidence ofa structural positive risk premium, thus 
rejecting the claimed convergence of equity and bond 
returns after a sequence of corrections. This suggests the 
inclusion of the premium in a stochastic model of equity 
returns. The inclusion of a risk premium indirectly 
addresses the inflation expectations issue and focuses on 
the possible deviation of the equity yield from a theoret- 
ically sound market yield that, ceteris paribus, should 
determine current stock market values. The comparison 
is thus only on the appropriate discount factor of future 
earnings, for the current risk-free rate. The presence of 
a risk premium in the market can in this framework be 
directly implied from market data. Over the 1985-2005 
period, the S&P 500 Index and its theoretical value, as 
given in Equation (1), performed as shown in Exhibit 9. 

The departure of the 10-year rate from the equity 
yield is captured by the valuation measure which quan- 
tifies the percentage difference between the actual and 
theoretical benchmark values as illustrated in the right 
y-axis in Exhibit 9. Consider the correction periods in 
Exhibit 6. The 1987 crisis, the 1997 limited correction, 
and the 2000 and 2002 market corrections were associ- 
ated with rising stock market prices and a decreasing the- 
oretical S&P 500 fair value as anticipated by the BSEYD. 
Other corrections, such as the July 1998 and the May-June 
2006 crises, occurred while the indicator was not in the 
danger zone. 

Following this statistical evidence, the amplitude 
and frequency of the corrections generated by the market 
can be studied as a function of the yield differential and 
other variables driving the misvaluation process. Consigli 
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EXHIBIT 9 


Actual and Theoretical S&P 500 and BSYR Process Minus 1 


S&P 500 Valuation Model 


bt ptt ttt ttt 
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Source: Consigli et al. [2006] 


et al. [2006] suggested a model extension of the basic Fed 
model assumptions in order to accommodate the various 
exceptions we have discussed. They proposed an asset 
pricing model for equity returns in which market insta- 
bility phenomena are modeled as a Cox process whose 
intensity and market impact depend endogenously on the 
BSYR and a market-specific risk premium. The eco- 


EXHIBIT 10 


Bond-Stock Earnings Yield Differentials and Market Corrections (<-2.5%) 


in the U.S. Market, 1985-2005 


S&P excesses and Yield differential 
1985-2005 


y = -0.0148x - 4E-05 
R" = 0,6941 
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nomic rationale of this approach is 
that even if the BSYR cannot alone 
be taken as a statistically robust 
leading indicator of prolonged 


market instability, the ratio provides 
Taea an appropriate over- or underval- 
| 140% uation measure as an endogenous 


j 120% state variable of an asset pricing 
model with discontinuous returns. 
The random dependence of the 
correction process from the yield 
differential is also supported by the 
evidence in Exhibit 9. This displays 
the estimated linear relationship 
between the BSEYD and the set 
of market corrections—adjust- 
ments beyond a predefined -2.5% 
threshold— in the U.S. during the 
1985-2005 period. The relation- 
ship is negative and statistically sig- 
nificant. The definition of an endogenous source of 
instability in the asset return model generalizes previous 
attempts to model equity returns as a Gaussian-Poisson 
process with constant jump intensity (Consigli et al. 
[2002]). This yields a model whose parameters can be 
estimated via maximum likelihood and fitted to market 
data as shown in Exhibits 9 and 10. 

The theoretical market price 
is generated by a discount rate that 
depends on the risk-free rate and 
a risk premium. The risk premium 
depends on factors unaccounted 
for in the original Fed model and, 
jointly with the BSEYD, deter- 
mines the amplitude of forth- 
coming market adjustments. 
Investors modify their strategic 
portfolio allocation from bonds to 
equity, and vice versa, analyzing 
the sustainability of the current 
deviation of equity prices from 
their theoretical value and then 
choosing how much to invest in 
bonds and equities. Expectations 
are updated depending on the joint 
movements of the 10-year rate plus 
a risk premium and aggregate 
earnings, assumed to be fully 


100% 
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+40% 
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incorporated in prices. The bond/stock/cash strategic 
portfolio selection problem can then be formulated and 
solved as a stochastic control problem. Once an exces- 
sive deviation of the current stock market value from its 
theoretical value is observed, the ratio between actual 
and theoretical stock price will depart from 1.0 and the 
ensuing adjustment can be assumed to be a random func- 
tion of the BSYR. The suggested approach is consistent 
with the dynamic version of the Fed model proposed by 
Koivu, Pennanen, and Ziemba [2005] and extends the 
evidence of a co-integration relationship between bond 
and equity yields in the U.S., Japan, and Germany. See also 
Durré and Giot [2005] for more on the co-integration of 
the Fed model in various countries. 


CONCLUSION 


The bond-stock model has been shown to add value 
to a buy and hold strategy in various U.S. and foreign 
markets and to have the ability to predict corrections to 
some extent. Thus, the model provides evidence that the 
equity risk premium is not constant and partially pre- 
dictable. Predictive ability is the best in the U.S. and in 
Japan, and in earlier time periods. Nevertheless, the model 
provided good results in the period 1980-2005. Campbell 
and Thompson [2005] investigated whether certain strate- 
gies out-predict the historical average of the equity risk pre- 
mium. The evidence offered by Berge and Ziemba [2006] 
is that bond-stock measures are better predictors than his- 
torical buy and hold averages. The Consigli et al. [2006] 


APPENDIX 
All exhibits are from Berge and Ziemba [2006]. 


EXHIBIT Al 
Stock Market Corrections (>10%) in the U.S., 1975-2005 


model incorporates the bond-stock measure in a sto- 
chastic asset-pricing model valuation. This allows direct 
market back testing in a simulation framework and leads 
to a stochastic control problem for portfolio selection (i.e, 
tilting using the bond—stock measure adds value). High 
current PE ratios relative to historical PE ratios have been 
shown to predict corrections. See Aron [1981]; Bleiberg 
[1989]; Campbell and Ammer [1993]; Campbell and 
Shiller [1998, 2005]; Fama [1970, 1991, 1998]; French and 
Poterba [1991]; Polk, Thompson, and Vuolteenaho 
[2006]; Shiller [2000]; Siegel [2002], and Ziemba and 
Schwartz [1991]. However, the bond-stock model that 
compares PE ratios to interest rates seems to have better 
predictive ability (Ziemba and Schwartz [1991]; Rolph and 
Shen [1999]; Wong and Chew [1999]; Yardeni [2003]; 
Ziemba [2003], and Berge and Ziemba [2006J). The crit- 
icisms of the model by Campbell and Vuolteenaho [2004] 
and others are valid. The Modigliani and Cohn [1979] 
inflation confusion argument is one way to theoretically 
justify the model. 

The research on the bond-stock model surveyed in 
this article contributes to the research on the role the model 
plays in portfolio strategies and clarifies the model's poten- 
tial as a market crisis signal. The analysis also confirms the 
partial predictability of equity returns and the need to take 
agents’ risk aversion into account when assessing market 
mean-reverting behavior. Furthermore, the analysis pro- 
vides a basis to statistical models of equity returns which 
have an endogenous source of instability, thus leading to 
an approach to strategic asset allocation over time. 


Although the duration of the correction starting in January 1977 is longer than one year, it is considered a correction because the stock market 
dropped 11% within 10 months between January and October 1977. The same is true for the correction starting in April 2000; the stock market 


dropped 23.9% within the following 12 months. 


Correction Number Start Date Duration (Months) _ Decline (%) 
1 July 1975 3 -11.8 
2 January 1977 14 -15.2 
3 April 1981 6 -11.6 
4 December 1981 8 -11.1 
5 September 1987 3 -29.4 
6 June 1990 . 5 -14.1 
7 July 1998 2 -14.7 
8 April 2000 18 -31.2 
9 April 2002 6 -29.1 
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EXHIBIT A2 
Evaluation of the Performance of the Strategies for the U.S. 


The mean excess return is the average monthly excess return of the strategy over the stock market. Terminal values refer to the gross performance 
of U.S.$100 invested using the strategy signals. 


Overall Performance of the Strategies/Stock Market 


Number of Months Mean Log Standard Sharpe Mean Excess Terminal 
Strategy in the Stack Market Return Deviation Ratio Retarn Value (US$) 
1 319 (85.75%) 0.01194 0.03933 0.17814 0.00162 8,480.41 
2 300 (80.65%) 0.01255 0.03737 0.20378 0.00222 10,635.11 
3 325 (87.37%) 0.01222 0.03936 0.18517 0.00190 9,420.42 
4 314 (84.41%) 0.01246 0.03835 0.19632 0.00214 10,299.03 
Stock Market (1975 -2005) 372 (100%) 9.01032 0.04347 0.12402 4,649.75 _ 
5 273 (87.5%) 0.01254 0.03887 0.19773 0.00224 5,009.00 
6 266 (85.26%) 0.01300 0.03835 0.21236 0.00270 5,781.16 
7 260 (83.33%) 0.01215 0.03831 0.19039 0.00185 4,432.10 
8 254 (81.41%) 0.01225 0.03785 0.19529 0.00195 4,569.44 
Stock Market (1980-2005) 312 (100%) 0.01030 0.04389 0.12405 2,490.17 


EXHIBIT A3 
Stock Market Corrections (>10%) in Germany, 1975-2005 
Although the duration of the correction starting in October 1978 is longer than one year, it is considered a correction because the stock market 


dropped 12.5% within 9 months between October 1978 and June 1979. The same is true for the correction starting in March 2000; the stock 
market dropped 20.7% within the following 12 months. 


Correction Number Start Date Duration (Months) Decline (%) 


1 April 1976 7 -13.7 
2 October 1978 18 -15.9 
3 February 1984 6 -11.2 
4 May 1986 3 -16.1 
5 December 1986 3 -17.9 
6 September 1987 5 -40.7 
7 August 1990 2 -31.0 
8 June 1992 4 -18.7 
9 May 1994 il -13.6 
10 August 1997 3 -15.4 
i August 1998 2 -22.4 
12 March 2000 19 -44.0 
13 April 2002 12 -53.9 


EXHIBIT A4 
Evaluation of the Overall Performance of the Strategies for Germany 


The mean excess return is the average monthly excess return of the strategy over the stock market. Terminal valucs refer to the gross performance 
of €100 invested using the strategy signals. 


s Overall Performance of the Strategies/Stock Market 
Number of Months Mean Log Standard Sharpe Mean Excess Terminal 


Strategy in the Stock Market Return Deviation Ratio Return Value (€) 

1 302 (81.18%) 0.00881 0.05150 0.09157 0.00068 2,652.80 

2 265 (71.24%) 0.00789 0.04986 0.07613 -0.00024 1,884.15 

3 322 (86.56%) 0.00934 0.05455 0.09614 0.00121 3,228.95 

4 312 (83.87%) 0.00922 0.05425 0.09446 0.00109 3,088.99 

Stock Market (1975-2005) 372 (100%) 0.00813 0.05864 0.06886 2,061.27 
5 279 (89.42%) 0.00963 0.05836 0.09405 0.00123 2,014.62 

6 241 (77.24%) 0.00910 0.05651 0.08787 0.00071 1,711.54 

i 259 (83.01%) 0.01032 0.05736 0.10785 0.00193 2,504.27 

8 253 (81.09%) 0.01010 0.05719 0.10431 0.00170 2,337.64 

Stock Market (1980-2005) 312 (100%) 0.00840 0.06201 0.06870 1,373.26 
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EXHIBIT A5 
Stock Market Corrections (>10%) in Canada, 1975-2005 


Although the duration of the correction starting in September 2000 is longer than one year, it is considered a correction because the stock market 
dropped 36.4% within 12 months between September 2000 and August 2001. 


Correction Number Start Date Duration (Months) Decline (%) 


1 July 1975 4 -11.0 
2 June 1976 6 -14.6 
3 March 1980 1 -18.7 
4 July 1981 12 -39.4 
5 December 1983 8 -13.8 
6 September 1987 3 -25.6 
7 January 1990 4 -14.5 
8 August 1990 3 -10.9 
9 February 1994 5 -11.1 
10 June 1998 3 -25.0 
11 September 2000 13 -41.2 
12 April 2002 6 -21.3 


EXHIBIT A6 
Evaluation of the Overall Performance of the Strategies for Canada 


The mean excess return is the average monthly excess return of the strategy over the stock market. Terminal values refer to the gross performance 
of C$100 invested using the strategy signals, 


Overall Performance of the Strategies/Stock Market 
Number of Months Mean Log Standard Sharpe Mean Excess Terminal 


Strategy in the Stock Market Return Deviation Ratio Return Value (C$) 
1 295 (79.3%) 0.00954 0.04243 0.08053 -0.00020 3,479.37 
2 276 (74.19%) 0.00937 0.04110 0.07906 -0.00036 3,269.07 
3 313 (84.14%) 0.00963 0.04384 0.07986 -0.00011 3,590.86 
4 304 (81.72%) 0.00997 0.04301 0.08933 0.00023 4,075.27 
Stock Market (1975-2005) 372 (100%) 0.00974 0.04925 0.07337 3,743.49 
5 273 (87.5%) 0.00937 0.04336 0.07839 0.00094 1,859.80 
6 256 (82.05%) 0.00951 0.04146 0.08535 0.00108 1,942.95 
7 265 (84.94%) 0.00900 0.04235 0.07149 0.00057 1,656.33 
8 254 (81.41%) 0.00890 0.04188 0.06997 0.00047 1,607.12 
Stock Market (1980-2005) 312 (100%) 0.00843 0.04941 0.04977 1,387.23 


EXHIBIT A7 
Stock Market Corrections (>10%) in the U.K., 1975-2005 


Although the duration of the correction starting in September 2000 is longer than one year, it is considered a correction because the stock market 
dropped 14.7% within 12 months between September 2000 and August 2001. 


Correction Number Start Date Duration (Months) Decline (%) 
0 


1 June 1975 2 -13, 

2 February 1976 9 -25.6 
3 October 1977 5 -14.0 
4 May 1979 3 -14.6 
5 September 1981 1 -16.1 
6 October 1987 2 -33.4 
7 September 1989 2 -10.4 
8 January 1990 4 -12.3 
9 July 1990 3 -16.0 
10 June 1992 3 -14.9 
14 February 1994 5 -15.1 
12 May 1998 5 -14.3 
13 January 2000 2 -10.2 
14 September 2000 13 -17.9 
15 April 2002 10 -31.6 
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EXHIBIT A8 
Evaluation of the Overall Performance of the Strategies for the U.K. 


The mean excess return is the average monthly excess return of the strategy over the stock market. Terminal values refer to the gross performance 
of £100 invested using the strategy signals. 


Overall Performance of the Strategies/Stock Market 
Number of Months Mean Log = Standard Sharpe Mean Excess Terminal 


Strategy in the Stock Market Return Deviation Ratio Return Value (£) 

1 329 (88.44%) 0.01426 0.05020 0.14825 0.00102 20,163.11 

2 314 (84.41%) 0.01363 0.04947 0.13757 0.00039 15,911.90 

3 362 (97.31%) 0.01346 0.05309 0.12504 0.00022 14,948.86 

4 354 (95.16%) 0.01323 0.05279 0.12136 -0.00001 13,714.36 

Stock Market (1975-2005) 372 (100%) 0.01324 0.05598 0.11470 13,787.40 
5 297 (95.19%) 0.01204 0.04420 0.12277 9.00101 4,275.28 

6 289 (92.63%) 0.01163 0.04363 0.11495 0.00060 3,760.35 

7 300 (96.15%) 0.01218 0.04419 0.12616 0.00116 4,477.37 

8 293 (93.91%) 0.01196 0.04358 0.12266 0.00093 4,168.13 

Stock Market (1980-2005) 312 (100%) 0.01103 0.04814 0.09179 3,121.47 


EXHIBIT A9 
Stock Market Corrections (>10%) in Japan, May 1985-December 2005 
Although the duration of the correction starting in January 1990 is longer than one year, it is considered a correction because the stock market 


dropped 40% within 12 months between January and December 1990. The same is true for Corrections 4 (23.5% decline between June 1994 
and May 1995), 6 (15.3% decline between August 1997 and July 1998), and 7 (20.1% decline between April 2000 and March 2001). 


Correction Number Start Date Duration (Months) Decline (%) 


1 June 1987 7 -20.3 
2 January 1990 31 -56.2 
3 September 1993 3 -17.6 
4 June 1994 13 -27.0 
5 July 1996 7 -17.5 
6 August 1997 15 -31.1 
7 April 2000 37 -51.8 


EXHIBIT A10 


Evaluation of the Overall Performance of the Strategies for Japan 


The mean excess return is the average monthly excess return of the strategy over the stock market. Terminal values refer to the gross performance 
of ¥100 invested using the strategy signals. 


Overall Performance of the Strategies/Stock Market 
Number of Months Mean Log Standard Sharpe Mean Excess Terminal 


Strategy in the Stock Market Return Deviation Ratio Return Value (¥) 
1 222 (89.52%) 0.00612 0.05049 0.08427 0.00305 455.72 
2 214 (86.29%) 0.00627 0.04942 0.08921 0.00320 473.45 
3 229 (92.34%) 0.00648 0.05097 0.09059 0.00341 498.62 
4 226 (91.13%) 0.00640 0.05077 0.08942 0.00334 489.16 
Stock Market (5/85-12/05) 248 (100%) 0.00307 0.05778 0.02085 213.88 
5 172 (91.49%) 0.00120 0.05050 -0.00157 0.00168 125.35 
6 169 (89.89%) 0.00128 0.04994 -0.00008 0.00176 127.13 
7 174 (92.55%) 0.00177 0.05130 0.00960 0.00225 139.56 
8 172 (91.49%) 0.00120 0.05050 -0.00157 0.00168 125.35 
Stock Market (5/90-12/05) 188 (100%) -0.00048 0.05745 -0.03062 7 91.39 
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Chapter 10 + Efficiency of Betting Markets 
Abstract 


This is a survey of efficiency in racing, sports, and lottery markets. The win market 
is efficient but exhibits a favorite-longshot bias. The place and show markets, which 
involve more possible finishes, allow inefficiencies by using the win probabilities. These 
biases are discussed for U.S. and Hong Kong markets. The Kelly capital growth criterion 
is useful to implement a model to exploit these inefficiencies. Exotic markets involve 
even more complex sets of bets. Finally, possible inefficiencies in cross-track betting 
are discussed. Football and basketball betting markets are largely efficient. Lotteries 
provide interesting markets with one way to potentially exploit them being the use of 
unpopular numbers. 


Keywords: market efficiency, horserace betting, sports betting, lottery wagers 


1. INTRODUCTION 


Financial economists have long been interested in the efficiency of financial markets. 
Kendall (1953) examined the behavior of industrial share prices and spot prices for 
cotton and wheat. By analyzing serial correlations, prices appeared to follow a random 
walk. In the 1960s the focus was on defining efficiency and performing tests for a range 
of efficiency notions. Roberts (1967) defined weak, semi-strong, and strong form effi- 
ciencies as holding when stock market prices reflect all price information, all publicly 
available information, and all information, respectively. Most financial markets have 
been shown to be efficient in the weak form. The evidence for semi-strong is mixed; 
see Sung and Johnson (2008) for a survey of the racing literature. The strong form is 
largely inefficient. See Fama (1970) for a survey of this work in financial markets. The 
exceptions, termed anomalies, include seasonal patterns such as the small firm January 
effect, turn-of-the-month and year effects, holiday effect, day of week, time of day, the 
Value Line enigma, and cross-sectional regularities that apply to stocks with low price 
to earnings ratios or with earnings surprises, and so forth. See the surveys by Hawaw- 
ini and Keim (1995, 2000), Ziemba (1994), and Keim and Ziemba (2000) for more 
details. 

Fama (1991) updated his earlier survey. Tests for return predictability focus on fore- 
casting returns using variables such as interest rates and dividend yields. Event studies 
formalize the semi-strong form idea by testing whether or not there are adjustments 
of prices to specific public announcements. Finally, the strong form concept is studied 
through tests for private information. The evidence is that future returns are predictable 
from past returns, dividend yields, and term structure variables. On the face of it, this is 
a violation of weak-form efficiency. But, as suggested by Roll (1977), since every test 
of efficiency must be a joint one with a maintained equilibrium hypothesis of price 
formation (e.g., the capital asset or arbitrage pricing models), this violation is 
confounded by the joint hypothesis problem of whether there is a rational variation 
over time in expected returns or systematic deviations from fundamental value. While 
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arguments can be made that increased returns may be occurring because of increased 
tisk, which is difficult or impossible to measure accurately, there is very strong evidence 
that most or all of the gains in securities markets have occurred during the seasonally 
anomalous periods. Ritter and Chopra (1989) and Cadsby (1992) show, for example, 
that the only periods where risk as measured by the capital asset pricing model is 
rewarded with equity returns are precisely at the anomalous periods such as the day 
before holidays, the turn of the month, in the first two weeks of January for small stocks, 
and so on. Ariel (1987), Lakonishok and Smidt (1988), and Hensel et al. (2000) showed 
that essentially all the stock market’s gains during the twentieth century in the U.S. 
occurred in the first half of the month. Event studies are more straightforward and less 
controversial, since they are able to provide more clear-cut evidence of the effect of new 
information. Regarding strong-form tests, there is considerable evidence that corporate 
insiders have private information that is not fully reflected in current prices. 

Sports and betting markets are well suited for testing market efficiency and bettor 
rationality. This is because vast amounts of data are available, in the form of prices (for 
devising technical systems) and other information (for devising fundamental systems), 
and each bet has a specified termination point when its final asset value is determined. 
For rationality tests, markets with this latter property offer an advantage over markets, 
like securities markets, where the current value depends upon future events and current 
expectations of future values. Also, some wagering markets have characteristics that 
reduce the problematic nature of the aforementioned joint hypothesis test. For instance, 
Dana and Knetter (1994) note that point spread bets on National Football League games 
all have identical risk and return characteristics, as well as similar horizons. This allows 
a test of efficiency without specifying the bettors’ utility functions. 

The special properties of sports betting and lotteries might lead one to speculate that 
they are even more efficient than financial markets. However, there is another aspect 
to these markets that confounds the notion of rationality: for them to be offered, the 
average bettor must lose. Indeed, given the transaction costs involved in these markets 
(e.g., about 13-30% for horse racing and about 50% for lotteries), the average losses 
are large. This has not stopped the search for profitable wagering systems, though, and 
there are some notable successes. For example, Thorp (1961, 1962) demonstrated that 
card-counters can win playing blackjack. This survey of research on horse racing, sports 
betting on football and basketball, and lotteries reports numerous studies of efficiency 
in these markets. Several profitable systems are also described, though.' The continued 
success of these winning systems tends to be related to some complicating factor in its 
development or execution. For instance, the system may involve short odds and complex 
probability estimation (e.g., place and show wagering at the racetrack), it may rely on 
syndicates of bettors (e.g., cross-track horse race betting), it could require extremely 
long time horizons (e.g., lotteries), or extensive data collection and statistical work (e.g., 
fundamental handicapping systems for horse racing). The winning systems described 
are, of course, just a subset of the winning systems used in practice. The incentives to 


'Beyond the academic work surveyed here, evidence abounds of individuals who have successfully beaten 
the odds. See, for example, Akst (1989), Benter (1994, 2001), and Beyer (1978). 
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disclose details of a winning system may not be sufficient in some cases given that such 
an action typically reduces the system’s profitability as others employ it. Finally, we also 
discuss optimal betting strategies for exploiting inefficiencies when they are present. 

Sports betting and lotteries involve substantial transactions costs. Because it directly 
affects prices, the take—what the gambling establishment keeps for its operation—is 
properly accounted for in all of the analyses discussed in this survey. Another cost in 
these markets is for information (e.g., tip sheets at the racetrack). Costly information 
requires a redefinition of efficiency, one where prices are said to reflect information to 
the point where the cost of additional information just equals the benefit of acting on 
that information. But because they are difficult to measure, this survey ignores informa- 
tion costs (and other transaction costs beyond the take). Thus, our general findings of 
efficiency gain further support after the introduction of these other costs. In racing, the 
advent of rebates has driven the take from 13% to 30% for various bets to about 10% 
for large bettors. Just like shopping at a wholesale market for food, large bettors get a 
quantity discount. The rebaters and the large bettors share a discount given by the track 
for the signals (outcomes). The net effect of this is that when these discount bets are 
blended with the rest of the bets, the effective track take for the small bettor is more 
than the posted take. Rebates at a lower level are available to small bettors. 

Figure 1 provides a taxonomy for the types of games that we discuss. Games are 
classified by: (1) whether the chance of winning is purely luck or can be influenced with 
skill; and (2) whether the payoff upon winning is predetermined or can be improved with 
skill. Luck-luck games allow no possibility of discovering a profitable strategy, and so 
as markets they are trivially efficient. On the other hand, there need not be a guarantee 
of efficiency for luck-skill games such as lotto (where we discuss a strategy of betting 
unpopular numbers, which does not affect the probability of winning but does affect the 
payoff upon winning) and skill-luck games (which are relatively uncommon). Blackjack, 
a skill-skill game, allows a profitable strategy. For horse racing, another skill-skill game, 
we review findings that certain forms of wagers are efficient while others are not. 

Unlike most financial securities markets, the average lottery and sports betting par- 
ticipant must lose. We may, indeed, choose to differentiate gambling and investing by 


CHANCE OF WINNING 


COMPLETE LUCK SKILL INVOLVED 


Example: Pay $1 for a chance to 
pick all winners of hockey games 


A | COMPLETE Scratch lottery games with on a particular day. From those 
fixed payment who have all correct selections, 
one name is randomly drawn and 
awarded $100,000, 


Lotto, such as 6/49, with Spone pool games 
i orse racing 
some or all pari-mutuel 


V Blackjack 
INVOLVED payoffs Sports betting 


FIGURE i Taxonomy of games. Source: Adapted from Ziemba et al. (1986). 
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their expected returns, using the terms gambling when the expected profit is negative 
and investing when the expected profit, including all transactions costs and risk adjust- 
ments, is positive. Obviously, a willingness to assume risk in the face of negative 
expected returns is inconsistent with the traditional assumptions that (1) individuals 
maximize the expected utility of wealth, and (2) utility functions are concave, that is, 
risk aversion. Instead of the second assumption, Friedman and Savage (1948) assumed 
a utility function that is convex in a neighborhood of the individuals present wealth but 
concave over higher and lower wealths. Given their different payoff distributions, simul- 
taneously purchasing lottery tickets and insurance can be consistent with this form of 
a utility function. Markowitz (1952) offered a functional form that eliminates some 
behavior admitted by Friedman and Savage’s form that is not generally observed. He 
also pointed to the possibility of a utility function that recognizes the fun of gambling. 
Conlisk (1993) formalized this notion and found his model to be largely consistent with 
actual risk-taking behavior. 

Other surveys on this topic include Clotfelter and Cook (1991) and three chapters 
in Part VIII of this volume. On lotteries, Thaler and Ziemba (1988) on horse racing 
and lotteries, Hausch and Ziemba (1992) and Vaughan Williams (2003, 2005) on sports 
betting and lotteries, and Hausch et al. (1994a, 2008) on horse racing. Lane and Ziemba 
(2004, 2008) study hedging strategies for jai alai, a sport not considered in this survey. 


2. EXTENT OF GAMBLING IN THE U.S. 


The level of gambling in the U.S. is enormous. Welles (1989) estimates $240 billion 
as the total annual wagering by Americans, both legal and illegal; an amount that is 
growing at 10% per year. By 1992, the estimate is $329.9 billion including $29.9 billion 
of gross revenue for governments and gaming institutions, more than six times annual 
movie ticket sales in the U.S. (Kleinfield, 1993). Table 1 presents gross gambling rev- 
enues from various gambling from 1985 to 1992. Pari-mutuel betting revenues have 
grown slowest, while revenues from Indian gaming have dramatically risen. 

Lotteries accounted for about one-third of gross gambling revenues over this period; 
see Table 1. On average, the lottery returns are only about 54% of sales. For pari-mutuel 
wagering, the average payback is about 80%, with the state government collecting usu- 
ally only a few percent. That tax rate can vary across the tracks within a state, though. 
For example, the three major thoroughbred tracks in south Florida have long feuded 
over the prime winter racing season when tourism is at a peak. Florida passed a bill 
that in 1994 granted the prime season to Gulfstream Park with a 3.0% state tax. Hialeah 
Park’s season was taxed by the state at 1.15%, while Calder’s tax was 2.4%. Thalheimer 
and Ali (1995, 2008) found that demand for racetrack betting is price elastic, which sug- 
gests that track-take revenue should increase with a reduction in the track-take fraction. 
By 1990, 34 states were offering pari-mutuel betting on thoroughbred or standardbred 
horses, or on greyhounds. Thalheimer and Ali (1995) also found that the presence of a 
state lottery lowered both attendance and the average bettor’s wager. See Table 2. 
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TABLE 1 Gross Gambling Revenues from Various Gambling Activities ($billions) 


Year Lottery Casino? Pari-mutuel? Bingo Indian gaming Total® 
1985 5.2 5.5 3:1 0.91 0.09 15.4 
1986 6.3 hr 3.2 0.94 0.10 16.9 
1987 6.6 6.4 3.3 0.90 0.11 18.3 
1988 8.4 Ta 3.5 0.88 0.10 21.5 
1989 9.6 TA 3.6 0.95 0.30 24.0 
1990 10.3 8.7 3.7 1.0 0.48 26.2 
1991 10.2 9.0 3.7 1.1 0.72 26.7 
1992 11.5 10.1 3.7 1.1 1.5 29.9 


*Does not include Indian gaming. 

Includes horse and greyhound racing, jai alai, and off-track betting. 

“Because of other forms of gambling, columns sum to less than the total column. 
Source: Kleinfield (1993). 


TABLE 2 State Lottery Startup Dates, FY 2002 Sales, Sales Per Capita 
FY 2004, and Rank 


State Startup Sales ($ million) Per capita sales Rank 
Arizona 1981 294.82 64.75 38 
California 1985 2,915.90 82.57 29 
Colorado 1983 407.97 87.71 28 
Connecticut 1972 907.9 259,68 9 
Delaware’ 1975 674.01 777.57 3 
Florida 1988 2,330.36 178.56 18 
Georgia 1993 2,449.36 309.66 7 
Idaho 1989 92.67 79.21 31 
Illinois 1974 1,590.15 134.78 24 
Indiana 1989 626.31 118.17 26 
Iowa 1985 181.22 71.20 37 
Kansas 1987 190.08 82.12 30 
Kentucky 1989 638.72 175.52 19 
Louisiana 1991 311.62 75.50 35 
Maine 1974 157.9 141.54 22 
Maryland 1973 1,306.55 252.10 11 
Massachusetts 1972 4,213.22 682.60 5 
Michigan 1972 1,688.04 194.88 14 


Minnesota 1990 377.36 76.12 34 
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TABLE 2 (continued) 


Missouri 1986 
Montana 1987 
Nebraska 1993 
New Hampshire 1964 
New Jersey 1970 
New Mexico 1996 
New York 1967 
North Dakota® 

Ohio 1974 
Oregon? 1985 
Pennsylvania 1972 
Rhode Island? 1974 
South Carolina 2002 
South Dakota‘ 1987 
Tennessee? 

Texas 1992 
Vermont 1978 
Virginia 1988 
Washington 1982 
Washington, DC 1982 
West Virginia? 1986 
Wisconsin 1988 
US.4 


"Excludes states with no lottery: Alabama, Alaska, Arkansas, Hawaii, Mississippi, 


585.19 
33.63 
73.91 

212.9 

2,068.52 

133.97 

4,753.62 


1,983.11 
816.94 
1,934.16 
1,171.10 
319.99 
629.96 


2,966.27 
81.99 
1,108.07 
438.61 
211.13 
848.63 
427.57 
42,153.43 


Nevada, North Carolina, Oklahoma, Utah, and Wyoming. 


Includes net VLT sales (cash in less cash out). 
“Includes gross VLT sales (cash in). 

4Sales began January 20, 2004. 

Sales began March 25, 2004. 


Source: Garrett and Wagner (2003) and Tax Foundation (2008). North American Association 
of State and Provincial Lotteries; Census Bureau population data; Tax Foundation calculations. 


137.97 
39.83 
53.17 

183.23 

252.40 
78.64 

304.24 

9.21 

188.21 

249.55 

189.86 

373.04 

227.63 

865.25 
72.82 

130.28 

148.83 

170.30 
78.06 

441.06 

718.81 
87.94 

184.25 
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Sports betting in Nevada was estimated at $1.3 billion in 1988, and illegal action 
nationwide at more than $26 billion (see Akst, 1989), with the largest event being the 
Super Bowi for the championship of American football. Super Bowl wagering in 2008 
was estimated to be over $50 billion. A study in Ontario (see Abbate, 1995) found that 
69% of adults played the lottery in the last month. For other forms of gambling, the 
corresponding percentages were 12% for sports gambling, 9% for card gambling, 7% 
for bingo, 3% for casinos, and 2% for horse racing. 
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3. RACETRACK BETTING MARKETS 


3.1. Introduction to Racetrack Betting 


The racetrack is a market in miniature in which wagering, odds, the outcome of the 
race, and payoffs all occur over a period of 20-30 min, followed by a new market, 
the next race. The number of horses in a race usually ranges from six to 12 in the 
U.S., three to 14 in Hong Kong, six to 18 in Japan, and four to 20 in England. 
A variety of wagers is available to bettors. The simplest is a wager to win, which 
involves picking the horse that finishes first. A place (show) bet involves picking a horse 
that is at least second (third). Exotic wagers are based on two or more horses, such as 
the daily double—picking the winners of two consecutive races; quinellas—picking the 
top two finishers in a race; and exactas—picking the top two finishers in a race in the 
correct order. As in securities markets, the public directly establishes the odds in a pari- 
mutuel betting market; the more the public bets on a horse, the lower its odds and the 
lower its return upon winning. The track pools the public’s wagers and returns a fraction 
Q—the track payback—of that total to the winners. The remainder—the track take—is 
shared by the track owners, the state government, jockey’s fund, race purses, and so on. 
An additional transactions cost is breakage, where the track rounds down returns to the 
nearest nickel or dime on the dollar. 

Weak-form efficiency means that, with access only to the publics odds, no bet allows 
a positive expected profit. A profitable betting system based on the publics odds is pos- 
sible only if weak-form efficiency is violated (see Epstein, 1977). We also consider a 
stricter version—strictly-weak-form efficiency—which requires that with access to the 
publics odds, all bets have equal expected return, namely Q, for a loss of 1 — Q. 

There is an extensive literature on horse race betting. We begin in Section 3.2 with 
studies of win betting; they point to a weak-form efficient win market but one that is not 
strictly-weak-form efficient given a strong and stable betting bias by the public against 
favorites and for longshots. Section 3.3 discusses the evidence against weak-form effi- 
ciency of the place and show markets. It also describes theoretical and implementation 
aspects of wagering schemes that have been devised to exploit this inefficiency. Exotic 
markets and cross-track betting are discussed in Sections 3.4 and 3.5, respectively. This 
survey of horse racing is restricted to pari-mutuel betting, that is, where the payoffs are 
directly determined by the public’s betting. In many countries outside of North America, 
betting is handled by bookies who offer fixed odds. This literature is discussed in 
Hausch et al. (1994a, 2008). 


3.2. Win Market 


Define W; as the public’s wager to win on horse i and W = },, W; as the public’s win 
pool. Then, ignoring breakage, OW /W; is the payoff per dollar wagered on horse i to 
win if and only if horse i wins. Let O; = QW /W; — 1. The odds on horse i are expressed 
as O; to one, or O;/1, and the return on a $1 wager to win on horse i is the original $1 
plus another O; in profit. Similarly, odds of x to y (or x/y) means that a $y wager 
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will pay $x + y, the original wager plus $x. The quantity W;/W can be interpreted as 
the public’s subjective probability that horse i will win. If these subjective probabilities 
are indeed correct, then the gross expected return on a win bet on any horse / is Q, 
that is, strictly-weak-form efficiency. Based on data on over 50,000 races and 300,000 
horses collected from numerous studies, Figure 2 shows the actual expected returns for 
various odds categories. (This extends the analysis of Snyder, 1978.) Expected returns 
are plotted against odds using transaction costs of 0.1533 (=1 — Q) which applies in 
California. The horizontal line indicates the point at which returns are the expected 
0.8467 (=Q). While objective odds (actual outcomes) and the public’s subjective odds 
are highly correlated, actual returns departing from the horizontal line indicates that 
strictly-weak-form efficiency is violated. 

Figure 2 exhibits a clear favorite-longshot bias, with expected returns falling as 
odds lengthen, down to only 13.7 ¢ per dollar wagered at odds exceeding 100/1. This 
bias is strong and stable, having appeared in datasets collected over several decades 
and from tracks of all sizes and throughout the world. (Early studies include Griffith, 
1949; McGlothlin, 1956; and Fabricand, 1965.) In England, a similar graph (see 
Ziemba and Hausch, 1986) pertains even for fixed odds systems where bookies con- 
struct this risk-preference situation to mirror bettor’s desires to overbet longshots and 
underbet favorites. Exceptions to this bias are Busche and Hall (1988) and Busche 
(1994) for Hong Kong and Japan. While clearly violating strictly-weak-form efficiency, 
Figure 2 illustrates that the favorite-longshot bias is insufficient to allow for profitable 
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FIGURE 2 Effective track payback less breakage for various odds levels in California. Filled squares 
show more recent data. Sources: e Ziemba and Hausch (1986) and Æ Ziemba (2004). 
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wagers except on odds below 3/10. In this range, expected returns are positive and 
are about 4-5% for horses with the shortest odds. Such short odds are relatively rare; 
hence, for practical purposes, the win market, while not strictly-weak-form efficient, is 
weak-form efficient. This supports the notion that bettors, at least in aggregate, behave 
rationally in the sense that few profitable wagers remain. The filled squares in Fig- 
ure 2 show more recent data (1997~2004) and in this period extreme favorites did not 
have positive expected value. Also, favorites have lower expected value than in the past. 
However, longshots are similarly overbet as in the data up to 1986. Rosett (1965) finds 
additional support for bettor rationality with evidence that a bettor will not make a sim- 
ple bet over a more complicated one (a parlay) that is preferred in terms of overall 
probability of payoff. 

McGlothlin (1956) studied the favorite-longshot bias over the course of a racing day, 
finding interesting departures from the usual bias for the last two races of the day. The 
penultimate race is frequently the feature race of the day, involving horses with well- 
documented records that have received more public scrutiny than usually occurs, and 
this is particularly so for the favorites. Interestingly, McGlothlin found little underbet- 
ting of these favorites. For the final race, McGlothlin calculated significantly positive 
expected returns for short-odds horses, atypically low returns for mid-range odds horses, 
and small but positive expected returns for the longest-odds horses. An explanation 
offered for these returns is that bettors who are losers—which describes the average 
bettor by the beginning of the last race of the day—use an end-game strategy to recover 
their day’s losses. Such a strategy will tend to overlook favorites because of their low 
payoff, key in on mid-range odds horses because of their adequate returns together with 
a reasonable probability of success, and perhaps even involve underbetting of extreme 
longshots given their low probability of success. Metzger (1985) also found a loose 
pattern of increased underbetting on the favorite as the racing day progressed. 

The favorite-longshot bias may be insufficient to allow for a practical and profit- 
able technical betting scheme, but its existence is still worthy of explanation. Two 
approaches based on rational betting behavior have considered risk-seeking bettors and 
differences of opinion. The first of these generates the bias because risk-seeking bet- 
tors will demand a higher expected return for favorites, which have a lower variance of 
return than do longshots. Weitzman (1965) and Ali (1977) estimated the utility func- 
tion of the representative bettor and showed it to be convex. Quandt (1986) proved that 
locally risk-seeking bettors are a necessary condition for the bias if the bettors have 
homogeneous beliefs, since a loss to the bettors in aggregate follows from a positive 
track take. Ali (1977) offered the second approach for generating the favorite-longshot 
bias. He considered races with two horses and assumed that the risk-neutral bettors hold 
heterogeneous beliefs about the likelihood of each horse winning, and all bettors wager 
an identical amount (say $1) on the horse with the highest expected return on the basis 
of their belief. Ali defined m; as the true probability that horse 1 wins and supposed that 
each bettor’s belief about that probability is a draw from a distribution that has 77 as its 
median value. Suppose that mı > 1/2, that is, horse 1 truly is the favorite. Since each 
bettor wagers $1, a pari-mutuel market belief of m, would require that a fraction 7 of 
the bettors have beliefs exceeding mı. This would be unexpected, though, since m; is the 
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median belief. The pari-mutuel market payoff must be lower on average to attract the 
additional bettors needed to sustain that payoff. The pari-mutuel result is that horse 1 
will tend to be the betting favorite but it will be underbet relative to its true probability. 
The reverse occurs with the other horse, the longshot, resulting in the favorite-longshot 
bias.? Blough’s (1994) model incorporated both non-linear utility and heterogeneous 
beliefs, and with a restriction on the beliefs of bettors he extended Ali’s analysis beyond 
two horses to an arbitrary number. He also developed an econometric test to distinguish 
between these two causes of the favorite-longshot bias but, as his data did not exhibit 
the bias, neither cause appeared to be present. 

Behavioral explanations have also been offered for the favorite-longshot bias. 
Thaler and Ziemba (1988) mentioned several, including: (1) overestimation of small 
probability events; (2) exaggerating a longshot’s contribution to expected utility (from 
prospect theory; see Kahneman and Tversky, 1979); and (3) bragging rights that are 
available to those who win with longshots but not to those winning with favorites. 
They also suggest that mental accounting (see Kahneman and Tversky, 1984) may be a 
promising way of modeling racetrack betting. It allows the bettor to be risk seeking in 
one domain while risk averse in another. The notion is that bettors have mental accounts 
and act as if the funds in these accounts are not fungible. Suppose bettors A and B have 
the same betting behavior. If A just lost his or her $100 on a horse race while B just read 
in the newspaper’s financial section of a $100 loss in his or her stock portfolio, then 
mental accounting predicts different subsequent wagering behavior because B’s loss is 
unrelated to racetrack betting. 

Figure 2 is based on final odds—the odds on which win payoffs are determined. 
The public’s odds can vary considerably over the betting period, though. Such price 
movements are of interest if they themselves reveal information. Asch et al. (1982) 
concluded that bettors late in the betting period do achieve higher returns than early 
bettors but were unable to devise a profitable strategy exploiting this. Asch et al. (1984, 
1986) and Asch and Quandt (1986), using new data, showed that wagering on the horse 
with the highest win probability, when that probability is partially based on the public’s 
marginal odds (i.e., odds calculated using only wagering late in the betting period), did 
not provide statistically significant profits. 

Assuming that a bettor knows perfectly the true win odds of the horses in a race, 
Isaacs (1953) determined the betting scheme that maximizes expected profit. Isaacs 
accounted for the effect of one’s bets on the odds and determined an algorithmic solution 
to the non-linear optimization problem.* Rosner (1975) extended Isaacs by introducing 
logarithmic utility and a budget constraint, but ignored one’s effect on the odds. Levin 
(1994) also extended Isaacs in several ways, for example, by introducing non-linear 
utility and a budget constraint. The public’s win odds are just one source of informa- 
tion at the track. Others include the racing form, which provides past performances 
on all the horses entered in a race, predictions of expert handicappers that appear in 


?This clearly is not a rational expectations equilibrium since bettors are not revising their estimates on the 
basis of the price that is offered. 
3Below, we develop a similar optimization problem for place and show betting. 
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local newspapers, and tip sheets that can be purchased at the track. Semi-strong form 
efficiency can be studied by considering this information. Using a multinomial logit 
model to measure the information content of the forecasts of professional handicappers, 
Figlewski (1979) found their forecasts to contain considerable information but the pub- 
lic’s odds discount almost all of it, in support of semi-strong efficiency. Snyder (1978) 
and a reanalysis of Losey and Talbott (1980) also showed that following the advice of 
professional handicappers is generally unprofitable. For harness racing, Ludlow (1994) 
tested a fundamental system. He considered several classification schemes for separat- 
ing winners and nonwinners in one dataset, and then cross-validated them on another 
dataset. 

Bolton and Chapman (1986) developed another fundamental system using a multi- 
nomial logit model to estimate win probabilities with 10 factors such as weight, post 
position, past performances, and jockey. They were essentially unable to demonstrate 
any significantly positive profits, though, in support of semi-strong form efficiency. 
Chapman (1994) extended this work to a 20-variable model and increased the dataset to 
2,000 races from 200 in Bolton and Chapman (1986). Despite a very simple scheme 
of betting $1 on the horse with the highest expected return as long as that return 
was positive, some evidence of profits was shown. Chapman also showed that the 
public’s win odds contained considerable information beyond that contained in his 
20 factors, but he did not investigate how its inclusion in his model would improve 
profits. 

Benter (1994) provided some details of a very elaborate implementation of a multi- 
nomial logit model that is reported to have been successfully employed in Hong Kong 
for a number of years. Benter described the importance of defining factors that extract as 
much information as possible, stating that a model involving only simplistic specifica- 
tions of factors does not provide sufficiently accurate estimates of winning probabilities. 
An example of the complexity of the factors that Benter uses is seen in his discussion of 
a distance preference factor to indicate the horse’s demonstrated preference for a race 
of the distance that it will run in the upcoming race. For predicting races of 1—1.25 mi, 
Bolton and Chapman (1986) deal with the distance preference factor through a variable 
that equals one if a horse had run three or four of its last four races at distance levels 
of less than 1 mi, and zero otherwise (p. 1047). Benter (1994), as a result of a large 
number of progressive refinements, uses a variable defined as follows: 


[F]or each of a horse’s past races, a predicted finishing position is calculated 
via multiple regression based on all factors except those relating to distance. 
This predicted finishing position in each race is then subtracted from the 
horse’s actual finishing position. The resulting quantity can be considered 
to be the unexplained residual which may be due to some unknown distance 
preference that the horse may possess plus a certain amount of random error. 
To estimate the horse’s preference or aversion to today’s distance, the resid- 
ual in each of its past races is used to estimate a linear relationship between 
performance and similarity to today’s distance. Given the statistical uncer- 
tainty of estimating this relationship from the usually small sample of past 
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races, the final magnitude of the estimate is standardized by dividing it by 
its standard error. The result is that horses with a clearly defined distance 
preference demonstrated over a large number of races will be awarded a 
relatively larger magnitude value than in cases where the evidence is less 
clear. 


A horse’s post position is further information that is available to the bettor. Horses 
with inside post positions typically have an advantage in gaining positions near the 
rail during the race, a benefit that increases with the number of turns in the race; in 
fact, horses lose about one length every turn for each horse between them and the rail. 
Canfield et al. (1987) demonstrated that a post position bias exists favoring the inside 
posts and showed that the market correctly adjusted odds to reflect it, both in the 
win market and other markets. An exception is when the situation is nonobvious and 
the bet is complex, such as exacta wagering on rainy days when the inside bias may 
not be present.* Such situations may allow profitable betting strategies. Betton (1994) 
also demonstrated a post position bias that was only partially reflected in the mar- 
kets odds. She did not consider whether the bias was sufficient for profitable betting 
strategies. 

Numerous trade publications purport to offer profitable systems that violate semi- 
strong form efficiency. Among the more “scientific” are Mitchell (1986), Quinn (1986, 
1987, 1992), and Ziemba and Hausch (1986, 1987). Tests of strong-form efficiency 
are more difficult to conduct as they involve data that is generally not publicly avail- 
able. Self-proclaimed accounts of successful gambling exploits do exist, though (e.g., 
Beyer, 1978, 1993). Also, Schnytzer and Shilony (1995) show that observing inside 
information in the odds can be beneficial. (See also Shin, 1992, 1993.) 


3.3. Place and Show Markets 


In the place and show markets, bets are profitable if the horse is at least second and third, 
respectively. Define P; as the amount bet by the public on horse j to place and P = F, P; 
as the place pool. Similarly, S% is the public’s show bet on horse k and S = }, Sx is the 
show pool. Then the payoff per dollar bet on horse j to place is 


oe 


7P, (1) 
if horses i and j are the first two finishers (in either order). Otherwise the payoff is zero. 
The first term of Equation (1) is the return of the dollar that was wagered. The second 
term recognizes that the track returns only OP of the pool, and from that amount the 
original wagers on horses i and j are returned. Of the remainder, PQ — P; — P;, half 
goes to the bettors of j, who share it on a per-dollar-wagered basis. The payoff on j 
is independent of whether j finishes first or second, but does depend on which horse 


4Because tracks are beveled, rain may collect in the inside post positions. 
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it finishes with. A bettor on j to place generally prefers that a longshot, not a favorite, 
finishes with horse j. The payoff per dollar bet on horse k to show is similar: 


4, QS=Si~ Sj ~ Si 


1 
3S 


(2) 


if horses i, j, and k are the top three finishers." 

One approach to place and show betting is to study whether profitable rules for those 
markets can be devised based on information in the win market. For instance, given the 
win market’s bias for favorites, perhaps place and show bets on favorites—which are 
more likely to pay off than win bets—allow positive profits. Asch et al. (1984, 1986) 
studied this possibility using a logit model to estimate win probability based on the win 
odds, the marginal win odds (odds based on late betting only), and the morning line. 
Betting to win, place, and show on the horse with the highest win probability (together 
with additional screens), returns far exceeded those of the average bettor but were not 
positive with statistical significance. Betting to place or show based just on information 
in the win market can only be improved with some attention to information in the place 
and show markets, since it is the public’s wagering in those markets that alone determine 
place and show payoffs. This was the approach of Hausch, Ziemba, and Rubinstein 
(Hausch et al., 1981, hereafter HZR), who demonstrated that weak-form inefficiencies 
can be identified in the płace and show markets. There are three distinct aspects to their 
model: (i) determining place and show probabilities which together with Equations (1) 
and (2) allow expected returns to be calculated; (ii) using the optimal capital growth 
model for wagering; and (iii) using approximations for implementation in real time. All 
three aspects will be described. 


3.4. Place and Show Probabilities 


Suppose that horse i’s true win probability is (which could be determined on the basis of 
fundamental handicapping or, as HZR assumed, be derived from the public’s wagering 
as W/W). Then Harville’s (1973) formulas estimate the probability that horse i is first 
and horse j is second as: 


qij = . (3) 


Equation (3) follows from assuming that the likelihood that j will be second, conditional 
on i being first, is the probability that j would be first if i were not in the race, which can 


5Equations (1) and (2) ignore breakage, which is incorporated by rounding down a payoff to the nearest 
nickel or, more typically, dime on the dollar. A further consideration is that U.S. tracks generally guarantee 
winners a minimum profit of 5%. So, even if (SQ — S; — S; — S,)/3S; < 0.05, which is termed a minus 
pool, show bettors on i receive a 5% return. Minus pools are most common when there is an extreme favorite. 
In that event, Hausch and Ziemba (1990b) show how risk-free arbitrage may be possible in the place and 
show markets. 
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be estimated to be q;/(1 — q;). Similarly, Harville estimated the probability that horses 
i, j, and k finish first, second, and third, respectively, to be: 


G4j Qk 
ijk = > . (4) 
ae A -~q) -qi — qj) 
Harville’s formulas have also been discussed by Savage (1957) and Plackett (1975). 
With these ordering probabilities, place and show probabilities can be calculated. The 
probability that i places is the probability that i finishes first or second, which is: 


qi + $ ai (5) 
jži 


and the probability that i shows is: 


at Ya + DY arji (6) 


JAA kži ji 


Dansie (1983) and Henery (1981) demonstrated that Harville’s formulas, Equa- 
tions (3) and (4), are implied when horses running times are independently and 
exponentially distributed. If horse i’s running time, T;, is exponentially distributed with 
mean 1/A, then Equations (3) and (4) follow with q; = \i/ È}, ahi An alternative deriva- 
tion supposes that T; has the extreme-value distribution with location parameter, that is, 
i’s running time has density function: 


Silti) = exp(t; — 0;) exp[—exp(t; — 0;)], ~% < t; < % 


Then, if Ti, ..., T, are independent, Equations (3) and (4) follow with q; = exp(@;)/ 
b2 j exp(—0;). These two derivations are related because the extreme-value distri- 
bution is the logarithm of an exponential distribution, and the logarithm preserves 
order. 

Underlying probability distributions not generating Equations (3) and (4) have been 
considered, too. Henery (1981) considered running times, 7;, that are independent and 
normally distributed as Equation (1). Then 


qij = P(T, < T; < min [T4]) 
k#i,j 


= | De +e- 0) [T [1 - DE +0- oo] oar, 


= kžij 


where ®(-) and ġ(-) are the standard normal cumulative distribution function (cdf) and 
density function, respectively. A similar expression holds for qi;,. Thus, with (0;), the 
ordering probabilities involve numerical integration. With (q;), possibly on the basis 
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of the public’s win bets or from handicapping, (6;) is the solution of the following 
non-linear system of equations: 


qi = PT; < T; < min [7)}) ae Per 
jÉi 


= j [] be 44 -0o ar. 


=O k žij 


Stern (1990) proposed another alternative to the Harville formulas. He assumed that 
running times are independent and gamma distributed with fixed shape parameter r. 
Such an underlying distribution is descriptive of a game where players score points 
according to independent Poisson processes and the winner is the first to score r points. 
With r = 1, Stern’s model is Harville’s model, and as r > œ, it converges to Henery’s 
model. 

Several empirical studies have considered these various models. Henery (1984) fit 
running times to the extreme-value distribution, finding that it best fit the faster running 
times. Harville (1973) showed that his formulas tend to over estimate the probability 
that favorites finish second or third and underestimate these same probabilities for long- 
shots. This reverse favorite-longshot bias for the likelihood that horses finish second 
and third was also observed by HZR. The betting data in these two papers exhibits the 
favorite-longshot bias. One might conjecture that a natural consequence of this bias for 
first position is the reverse favorite-longshot bias for other positions, like second and 
third, as probabilities over a horse’s finish positions must sum to one. However, Benter 
(1994), using Hong Kong data that does not exhibit the favorite-longshot bias, showed 
this conjecture to be incomplete at best in Hong Kong. Tables 3-5 show his results. 


TABLE 3 Public’s Estimate of Expected Win Probability Versus Actual Win 
Frequency (Based on 3,198 Races, Royal Hong Kong Jockey Club, September 
1986-June 1993) 


Range of estimated Horses Mean expected Actual win 

probabilities within range win probability frequency observed Z-statistic 
0000-0.010 1,343 0.007 0.007 0.0 
0.010-0.025 4,356 0.017 0.020 1.3 
0.025-0.050 6,193 0.037 0.042 2.1 
0.050-0. 100 8,720 0.073 0.069 -1.5 
0.100-0.150 5,395 0.123 0.125 0.6 
0.150-0.200 3,016 0.172 0.173 0.1 
0.200-0.250 1,811 0.222 0.219 -0.3 
0.250-0.300 1,015 0.273 0.253 -14 
0.300-0.400 716 0.339 0.339 0.0 
>0.400 312 0.467 0.484 0.6 


Source: Benter (1994). 
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TABLE 4 Harville Calculation of Expected Probability of Finishing Second Versus 
Actual Frequency (Based on 3,198 Races, Royal Hong Kong Jockey Club, September 


1986-June 1993) 


Range of estimated 
probabilities 
0.000-0.010 
0.010-0.025 
0.025-0.050 
0.050--0.100 
0.100-0.150 
0.150-0.200 
0.200-0.250 
0.250-0.300 
0.300-0.400 
>0.400 


Source: Benter (1994). 


Horses 
within range 


962 
3,449 
5,253 
7,682 
4,957 
3,023 
1,834 
1,113 
1,011 

395 


Mean Harville 
probability of second 


0.007 
0.018 
0.037 
0.073 
0.123 
0.173 
0.223 
0.272 
0.338 
0.476 


Actual second finish 
frequency observed 


Z-statistic 


199 


0.010 
0.030 
0.045 
0.080 
0.132 
0.161 
0.195 
0.243 
0.317 
0.372 


0.9 
53 
2.8 
2.3 
1.9 
-1.8 
-3.0 
-2.3 
-14 
—4.3 


TABLE 5 Harville Calculation of Expected Probability of Finishing Third Versus 
Actual Frequency (Based on 3,198 Races, Royal Hong Kong Jockey Club, September 


1986-June 1993) 


Range of estimated 
probabilities 
0.000-0.010 
0.010-0.025 
0.025-0.050 
0.050--0.100 
0.100-0.150 
0.150-0.200 
0.200-0.250 
0.250-0.300 
0.300-0.400 
>0.400 


Source: Benter (1994). 


Horses 
within range 


660 
2,680 
4,347 
6,646 
4,325 
2,923 
1,831 


Mean Harville 
probability of third 


0.007 
0.018 
0.037 
0.073 
0.123 
0.173 
0.223 
0.273 
0.341 
0.492 


Actual third finish 
frequency observed 


0.009 
0.033 
0.062 
0.087 
0.136 
0.178 
0.192 
0.213 
0.273 
0.333 


Z-statistic 


0.5 
4.3 
6.8 
4.0 
2.5 
0.7 
-3.4 
—4.9 
-5.3 
-8.3 


Table 3 categorizes horses by the public’s estimate of their win probability (through the 
win odds) and compares that estimate to the actual win frequency. There is no obvious 
bias. Tables 4 and 5 calculate Harville’s estimate of finishing second and of finishing 
third, respectively, and compare those with actual frequencies. The reverse favorite- 
longshot bias is clearly exhibited, and is more pronounced for third position than for 
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second. Benter’s explanation of this bias is that Harville’s formula does not recognize 
the increasing randomness of the contests for second and third place. 

Lo (1994) provided a theoretical basis for this bias, demonstrating that Harville’s 
formulas produce such a bias if the underlying distribution of running times is indeed 
independent gamma. HZR described a special type of late-charging horse that tends 
to either win or finish far back in the field, making a second or third place finish 
unlikely. For such horses, this “Silky Sullivan” phenomenon (named after a horse that 
displayed this racing pattern) provides another explanation of the probability bias for 
second and third. Stern (1994) described this as a type of information contained in 
the horse’s failure to win that j is not used to adjust the probabilities, and related it 
to the memoryless property of the exponential distribution for running times. Stern 
(1990) analyzed 47 races and found that ordering probabilities estimated using r = 1 
(Harville’s model) were less accurate than those estimated using r = 2. Using a likeli- 
hood approach and Japanese wagering data, Lo (1994) found r = 4 to be best for Stern’s 
model. He reported r = œ (Henery’s model) was best on his Meadowlands and Hong 
Kong data, though. Thus, while one running-time distribution model does not appear 
to hold universally, there is limited empirical support for Harville’s model. (Other 
empirical studies include Bacon-Shone et al., 1992a,b; Lo, 1994; and Lo and Bacon- 
Shone, 2008.) 

While Henery’s (1981) and Stern’s (1987) ordering probabilities are superior to 
Harville’s, the complex numerical calculations that they both require essentially pre- 
cludes them from being used on-track (unless one determines one’s own odds in advance 
of the race). Henery suggested using a first order Taylor series approximation, but 
Bacon-Shone et al. (1992b) demonstrated its inaccuracy. Stern (1994) also mentioned 
that this simplification does not seem to improve on Harville’s model. 

Thus, Harville’s model still is useful; in particular for place and show probability at 
tracks where the favorite-longshot bias is exhibited in the win market. The probability 
of place, say, involves adding the probability of first, for which there is a favorite- 
longshot bias, and the probability of second, which has a reverse bias when calculated 
using Harville’s formula. These biases tend to cancel each other, as HZR demonstrated. 
Lo and Bacon-Shone’s (2008) discount model retains the simplicity of the Harville 
model but attempts to directly correct its bias for ordering probabilities. They define 
LO(i, j|k) = logarithm of the odds that horse i beats j for the kth position given that 
neither i nor j finish in the top k — 1 positions. For Harville this is simply log(q;/q,), 
independent of k. Lo and Bacon-Shone assumed instead that 


LOG, j|k) = LOG, j|k), 


and further assumed that A, is decreasing in k, presuming that the relative abilities of 
the horses matters less as the finish position worsens and the prize money decreases. 
The parameters ^z and à; can be estimated to best fit with Henery’s or Stern’s models, 
or estimated to best fit data from a particular track, from which simple ordering proba- 
bilities qj; can be calculated. For example, Benter’s (1994) Hong Kong data generated 
values of M2 = 0.81 and à; = 0.65, and Lo et al. (1995) used A» = 0.88 and à; = 0.8 
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for their Japanese data. They reported an improved fit between actual and expected 
frequencies. 

For any of these ordering probability models, the expected return on a bet on horse i 
to place and show can, respectively, be calculated using Equations (1) and (2) as: 


OP-P.-P, 
EXP; = $ (ais + 9) (1 + ora), (7) 
jži i 
and 
QS ~ S; — S; — Sk 
EXS; = ` DCT + Qjik + jki) (1 + aes) (8) 


kžj iti 


Using Harville’s model and Equations (3) and (4), HZR studied Equations (5) and 
(6) with data from the 1978 summer season at Exhibition Park (1,065 races and 9,037 
horses over 110 days) and the 1973-1974 winter season at Santa Anita Racetrack (627 
races and 5,895 horses over 75 days). Table 6 shows the results of wagering on bets 
that, through Equations (5) and (6), are identified as having certain minimum levels of 
positive expected return. The results strongly suggest that inefficiencies can be iden- 
tified in the place and show markets. Furthermore, HZR found that such inefficiencies 
appear about 2—4 times per racing day. Additional calculations along these lines appear 
in Harville (1973). 


3.5. Optimal Capital Growth 


The previous section showed evidence of inefficiencies in the place and show markets. 
The second aspect of HZR’s model is to determine the bet size upon identifying a profit- 
able wager. HZR employ the optimal capital growth model, or the Kelly criterion, which 
maximizes the expected logarithm of wealth on a race-by-race basis. It was devel- 
oped by Kelly (1956) for information transmission independently developed by Latané 
(1959), extended by Breiman (1961) who provided rigorous proofs of the main results, 
and is described in detail by Hakansson and Ziemba (1995).’ Among its properties are: 
(1) it maximizes the asymptotic growth rate of capital; (2) asymptotically, it minimizes 
the expected time to reach a specified wealth goal; and (3) it outperforms in the long run 
any other essentially different strategy almost surely. While these are impressive asymp- 
totic properties, Aucamp (1993) asked: how many periods are required to be reasonably 
confident that the Kelly criterion will be superior? Both theoretical and experimental 
evidence point to the need for only a moderate number of plays when the risk is low but 


6See HZR for Equations (5) and (6) adjusted for breakage. Table 6 correctly accounts for the adverse effects 
of breakage. 

7See also Algeot and Cover (1988) and Thorp (2006) for a more general mathematical treatment, and 
MacLean et al. (1992) and MacLean and Ziemba (2006) for fractional Kelly strategies and a comparison 
of its properties. 
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TABLE 6 Results of Betting $1 at Exhibition Park and Santa Anita to Place and 
Show on Horses with a Theoretical Expected Profit of at Least a Specified Minimum 


Level 
Place Show 
Minimum Number Total net Rate of Number Total net Rate of 
evel of bets profit ($) return ($) of bets profit ($) return ($) 
Exhibition Park 
-04 225 5.10 2.3 612 33.20 5.4 
1.08 126 —10.10 —8.0 386 53.50 13.9 
1.12 69 11.10 16.1 223 40.80 18.3 
.16 40 5.10 12.8 143 26.30 18.4 
.20 18 5.30 29.4 95 21.70 22.8 
25 1l -2.70 -24.5 44 11.20 25.5 
1.30 3 —3.00 —100.0 27 10.80 40.0 
.50 0 — — 3 6.00 200.0 
Santa Anita 
1.04 103 12.30 =11,9 307 ~18.00 -5.9 
.08 52. 12.80 24.6 162 6.90 43 
12 22 9.20 41.8 89 3.00 3.4 
16 7 2.30 32.9 46 12.40 27.0 
1.20 3 —1.30 -43.3 27 6.20 23.0 
i25 0 — — 9 6.00 66.7 
.30 0 — — 5 5.10 102.0 
1.50 0 = — 0 — — 


Source: Hausch et al. (1981). 


a large number generally when the risk is high. See also Ziemba and Hausch (1986) for 
empirical calculations. 

Using Equations (5) and (6), assuming the appropriateness of Harville’s formu- 
las, introducing bets to place and show as decision variables, and accounting for 
one’s effect on the odds (but ignoring breakage), the optimal capital growth model 
solves optimization problem 1 (OP1). Notice that OP1 considers each possible 1-2-3 
finish of the horses, determines the logarithm of final wealth in that event, and aver- 
ages over all possible 1-2-3 finishes. For convenience, OP1 defines P; = P, + P; and 
Sije = S; + S; + Sg. OP! also involves a budget constraint with wo representing current 
wealth. 


8Kallberg and Ziemba (1994) described OP1’s generalized concavity properties. 
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Optimization problem 1 is shown as follows with place and show betting with the 
optimal capital growth model. 


O(P+Y i) pr) — (pit pj+ Bi) 


Pi Pj 
x Ee + | 
O(S+ D7, 51) — (sit sj +5. tSijx) 


n n n 
9i9j 9k + 
max ————— lo 3 
Pisi pap ip» (1 -qD — qi — q;) 8 e | si 5j 5k 


j=l] ji k=i 


jži kij Si+S; sjt+S; StS, 
n n 

+w- È it DY p 
i=1 ł=l 
ižij,k tžij 


n 


s.t. È (pi + s1) < wo, P20, s20, l=1,...,n. 
Izl 


3.6. Implementing the System and Empirical Results 


HZR and Hausch and Ziemba (1985) provided tests of OP1 on data from three North 
American racetracks. Figures 3, 4, and 5 show, respectively, the wealth level histo- 
ries for the 1978 season at Exhibition Park (Vancouver, BC), the 1973-1974 winter 
season at Santa Anita (Arcadia, CA), and the 1981—1982 winter season at Aqueduct 
(Jamaica, NY). Given the many approximations involved in the system, the authors 
established cutoffs on the expected return of a bet (see Equations (5) and (6)) that were 
necessary for wagering, cutoffs that decrease with the size of the track, and one’s confi- 
dence with the accuracy of the public’s win odds. Specifically, Figure 3 for Exhibition 
Park, a relatively small track, uses an expected return cutoff of 1.20. Figure 4 for Santa 
Anita, a much larger track, uses a 1.16 cutoff, and Figure 5 for Aqueduct uses 1.14. The 
track take is generally established by the state or province, and varied across these three 
tracks. For the seasons studied, the track takes were 18.9% for Exhibition Park, 17.5% 
for Santa Anita, and 15% for Aqueduct. To appreciate the dramatic long run effect of 
the take, Figure 5 also considers takes of 14% and 17%. 

Figures 3-5 are based on OP1’s use of final public wagers, that is, the presump- 
tion was that our bettor could wager last. In practice, that is not possible because of 
three time-consuming activities necessary for implementing OP! in real time. First is 
the input of the required data—the public’s win, place, and show bets on all horses 
in a race. This data is on the order of 3 four- or five-digit numbers for, perhaps, 
10 horses. Second is the solution of the non-linear optimization problem OP1. And third 
is the time necessary to make one’s bets before the end of the betting period. Since the 
public’s wagering can and commonly does change over the betting period, these three 
activities mean that one must work with data that only imperfectly forecast the eventual 
payoffs. To reduce the time involved in the first two activities, HZR developed regres- 
sion approximations to the solution of OP! that, once a horse was identified as being 
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FIGURE 3 Wealth level history, Exhibition Park, 1978 season. Results from expected log betting to 
place and show when expected returns 1.20. Initial wealth is $2,500, track payback is 81.9%, and breakage is 
accounted for. Source: Hausch et al. (1981). 
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FIGURE 4 Wealth level history, Santa Anita, 1973—1974 season. Results from expected log betting to 
place and show when expected returns 1.16. Initial wealth is $2,500, track payback is 82.5%, and breakage is 
accounted for. Source: Hausch et al. (1981). 
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FIGURES Wealth level history. Aqueduct, 1981-1982 season. Results from expected log betting to place 
and show when expected returns 1.14. Initial wealth is $2,500, breakage is accounted for and track paybacks 
considered are 83%, 85% (actual), and 86%. Source: Hausch and Ziemba (1985). 


a possible bet, required as inputs only the public’s win and place (or show) wagers on 
that horse as well as the win and place (or show) pools. Analyzing a small sample of 
races with data 2 min prior to the end of betting, HZR found that the problem of using 
such data is limited—bets identified as profitable based on odds 2 min before the end 
of the betting period generally remained profitable based on the eventual final odds. 
Hausch and Ziemba (1985) and Ziemba and Hausch (1987) improved upon HZR’s 
regression approximations and introduced additional approximations for multiple horse 
entries and multiple bets. Programmed into a speciality calculator, entry of the required 
data and calculations takes about 15 sec. 

Hausch et al. (1995) tested these approximations on 62 runnings of the Kentucky 
Derby (1934-1995). The anomaly was present during this period and $2,500 grew to 
$8,002. With a breeding filter (based on dosage theory), wealth increased to $12,508. 
See Bain et al. (2006) and Gramm and Ziemba (2008) in this volume for updates of this 
and applications to the Preakness and Belmont Stakes. 

Ritter (1994), in a revision of work predating HZR, also considered place and show 
betting. Instead of computing expected returns, Ritter used various filter rules to select 
place and show bets. He achieved positive profits using final odds, but not when wagers 
were based on the odds 1.5 min from the end of betting. 
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Lo et al. (1995) demonstrated that OP 1s performance can be improved by replacing 
Harville’s model with Lo and Bacon-Shone’s (2008) discount mode} that corrects for 
biases in Harville’s model. 


3.7. Does the System Still Provide Profits? 


We have a final odds place and show wager simulation; see Ziemba (2004). The effect 
of bets on payoffs was not considered. For 4,879 races from major U.S. tracks, there 
were 752 wagers made including full, half, and third Kelly wagers. The initial bankroll 
was $10,000 and yielded final bankrolls of $55,757 with full, $32,878 with half, and 
$24,100 with third Kelly. The assumptions were 


No rebates. 

Expected value cutoff = 1.14 for large tracks and 1.18 for small tracks. 

For all tracks evaluated, no bets were made when the odds were greater than 8.00/1. 
All surfaces and track conditions were included. 

All classes of races were included. 

Coupled entries were used and optimized. 

Place and show bets on the same horse were optimized. 

Bets on multiple horses in the same race were not optimized. 


Figure 6 shows the wealth path. The odds of the favorite, and often our place and/or 
show bet, tended to fall as the betting period ended. But more and more of the betting 
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FIGURE 6 Wealth path of place and show bets on 752 wagers from 4,879 races in 2003. 
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actually is not posted until after the race has begun. By 2008 this was about 50% largely 
because about 87% of the total bets for a typical U.S. racetrack are made off track. This 
money is bet before the race begins, but the time lags in its reporting in the pools from 
the other track betting and from professional betting near the start of the race causes 
this effect and significant odds changes from the off-track prices. A typical appliction in 
2006 with rebates provided a loss of about 7% over 80 U.S. racetracks with an average 
of about 9% rebate for a profit of about 2%. A test account with $5,000 grew to $30,000 
with total betting of about $1,500,000 over eight months. 


3.8. Exotic Markets 


For at least two reasons, exotic bets are typically more complicated to assess than 
bets to win, place or show: (1) they involve the outcome of two or more horses; and 
(2) one often has poor access to information about the public’s exotic wagering (in 
fact, some tracks display no exotic wagering data). The reason for not fully displaying 
exotic data is entirely due to the large quantity of it. For example, in a 10 horse race, 
the public’s win, place, and show bets involve 30 numbers. By comparison, there are 
45 quinella numbers, 90 exacta numbers, and 720 trifecta numbers. Exotic wagers tend 
to be popular with the public. One attraction is their low probability and high payoff 
characteristics. In view of the favorite-longshot bias for win bets, it is not surprising 
that longer-odds events than win bets will be heavily wagered. To illustrate a second 
attraction, consider the daily double. Bettors can create for themselves a daily double 
by wagering to win on a horse and then, if successful, betting all the proceeds to win on 
a horse in the next race. This self-constructed daily double, called a parlay, differs from 
a daily double in an important way. The parlay is subjected to the track take twice while 
the daily double pays it only once. Thus, transaction costs are higher with the sequence 
of win bets. A successful handicapper needs skills significantly better than those of the 
average bettor. As transaction costs decrease, though, success demands less of a skill 
advantage or, for the same skill advantage, profits are greater.? Most tracks appreciate 
this and charge a higher track take on exotic wagers, reducing their advantage. 

While a parlay that is a self-constructed version of a daily double pays the track 
take twice, it does allow the bettor more information. Since the daily double must be 
wagered before either race, the bettor has little information about the public’s view of 
the second race (other than any information that can be gleaned from the payoffs that are 
offered on daily double combinations, but that information is usually difficult to access). 
The parlay allows one to wager on the second race with a better sense of the public’s 
impressions of the horses. Ali (1979) found that returns of parlays and double bets 
were not significantly different. Asch and Quandt (1987), however, found that doubles 
are statistically more profitable than parlays. When parlay payoffs are adjusted as if 
parlay bettors paid the track take just once, then returns on parlays and doubles are 
not significantly different. Lo and Busche’s (1994) conclusions were similar for Hong 
Kong data. 


°For more details on this effect and a numerical example, see Benter (1994). 
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Asch and Quandt (1987) found some support for the notion that the smart money is 
in the exotic pools. The basis for this notion is that the informational content of the smart 
money is more difficult for the public to discern in the exotic market than it would be if 
it were wagered in the win market. Their analysis ignores the systematic biases of the 
Harville (1973) model, though. Also, Dolbear (1993) described a further bias in their 
comparison of the theoretical and subjective probabilities of exacta outcomes. Bacon- 
Shone et al. (1992b) addressed these concerns and concluded that the public’s exacta 
betting provides more accurate estimates of ordering probabilities (the probability that i 
wins and finishes second) than does the win market, and this is so using any of the prob- 
ability models offered by Harville (1973), Henery (1981), or Stern (1990). Similarly, 
the trifecta market provides more accurate estimates of their ordering probabilities than 
does the win market. 

Asch and Quandt (1988) analyzed the unbiasedness of the probabilities implied by 
exacta and trifecta bet fractions. Using simple linear regressions on the relationship 
between objective probabilities (estimated by win frequencies) and subjective probabi- 
lities (average bet fractions), they drew different conclusions for the two pools. Exacta 
pools implied probabilities that appeared to be unbiased while the trifecta bet fractions 
more weakly approximated the objective probabilities and some clear over/underbetting 
bias was exhibited. 

Ziemba and Hausch (1986) adapted for exotic wagering the techniques developed 
by HZR for place and show wagering: win probabilities from the efficient win mar- 
ket, Harville’s (1973) model to price other wagers and the Kelly criterion to determine 
wagers. Hausch et al. (1994b, 2008) developed general formulas for optimal wagering 
on exotic bets, allowing ordering probabilities based on Harville (1973), Henery (1981), 
or Stern (1990) with the help of approximations developed by Lo and Bacon-Shone 
(1993). Quinella data on 369 Hong Kong races was used to illustrate the system. 

Kanto and Rosenqvist (1994) developed a betting system for quinella bets (called 
double bets in Finland) at a Finnish racetrack. Instead of using the win odds data 
directly, they used maximum likelihood estimation and Harville’s (1973) model to 
estimate the win probabilities and the probabilities associated with a quinella bet by 
assuming that the quinella bet amounts for different combinations follow a multinomial 
distribution. Using the Kelly criterion for wagering and 111 races, they showed some 
evidence of positive profits. 


3.9. Cross-Track Betting 


Cross-track betting allows bettors to wager at their track (a cross track) on a race being 
run at another track (the home track). Since each track operates a separate pool, the 
payoffs at the tracks can differ. And, in fact, they do often differ, sometimes quite 
dramatically. For instance, a $2.00 win ticket on Ferdinand, the winner of the 1986 
Kentucky Derby, paid $16.80 at Hollywood Park in California, $79.90 at Woodbine in 
Toronto, and $90.00 at Evangeline in Louisiana. Using data from Triple Crown races, 
Hausch and Ziemba (1990a) developed and tested optimal betting strategies for cross- 
track betting to exploit different odds across tracks. One strategy identified whether 
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a risk-free hedge could be developed by betting each horse at the track where its offered 
odds were longest, in relative amounts that guarantee a profit. Examples where the vari- 
ance in odds across tracks was sufficient were provided. Also analyzed was the optimal 
capital growth strategy. This latter strategy was studied in two environments: (1) a sin- 
gle bettor at a cross track observing (perhaps by television) the home tracks odds; and 
(2) a syndicate of bettors, one at each track, communicating with each other. 

Leong and Lim (1994) also found evidence of profits using cross-track betting that 
exists between races in Singapore and Malaysia. Both papers showed profits but neither 
had sufficient data for statistically significant profits. Thus, further work is needed to 
understand cross-track betting, particularly as it is becoming more popular. Tracks have 
been facing declining attendance, due in part to the increase in other forms of gam- 
bling, for example, lotteries. Cross-track betting helps attendance by offering bettors 
the opportunity to wager on prestigious horses in races at large tracks. Since the cross 
track does not need to stage an expensive race and the home track receives a portion of 
the cross track’s revenues, both tracks can benefit from this arrangement. 


4. THE FOOTBALL BETTING MARKET 


Bettors on National Football League (NFL) games are offered a point spread. For exam- 
ple, suppose team A is a 10 point favorite over team B. Then a bet on A pays only if 
A wins by at least 11 points while a wager on B pays only if B either wins or loses by 
fewer than 10 points. If A wins by exactly 10 points, then wagers are usually refunded. 
Typically bettors pay $11 for a $10 profit when they win. This provides the bookmaker 
a commission and means that a bettor has to beat the spread 52.4% of the time to break 
even. When the actual point spread equals the offered point spread, the bookmaker 
receives no return. Otherwise, by perfectly balancing the wagers, a bookmaker can guar- 
antee a profit of 4.55% (since $21 is paid for each $22 wagered). The Las Vegas sports 
books, which dominate the market, offer opening point spreads on the coming week’s 
game. These spreads may change over the week, but bettors receive the spread offered 
at the time they placed their bet.!° 

The efficiency of NFL betting rests on the accuracy of the point spreads. An obvious 
and common approach to study their accuracy is to regress the actual point spread on 
the offered point spread. If, say, bettors tend to wager on underdogs then, to balance the 
books, the bookmaker has to offer point spreads lower than unbiased expectations about 
actual point spreads. Bettor biases should be reflected in the point spread offered and, 
if they are sufficiently large, should allow profitable betting opportunities, which would 
reject efficiency. 


'OThese dynamics are also present in horse race wagering against bookies. Wagering on jai alai is similar 
(see Lane and Ziemba, 2004, 2008), too, but its odds change during the contest as points are scored rather 
than before the contest as with sports betting. Pari-mutuel betting is different, though; its odds change over 
the course of the betting period as betting patterns change, but payoffs to all bettors are based only on the 
final odds. 
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Let A, be the actual point spread in game i and let P;, be the point spread offered. 
The following equation can be estimated: 


A; = Bı + B2P; +6, (9) 


where e; is the error term. The efficiency test is the joint hypothesis that B; = 0 and 
B2 = 1. Pankoff (1968), Zuber et al. (1985), and Sauer et al. (1988) all found signifi- 
cant support for the hypothesis. Gandar et al. (1988) results are similar for both opening 
and closing point spreads (the point spread can change over the betting period as book- 
makers attempt to balance their books). Their large t-statistic on Bz and low R? (3.4% 
for closing data) suggest that while the point spread for any particular game is a poor 
predictor of the actual point spread, it is a good predictor of the average actual point 
spread for a group of games with this point spread. 

While these results support market efficiency, they are not directly useful in answer- 
ing whether there might be technical rules that are economically profitable. Vergin and 
Scriabin (1978) used NFL data from 1969 to 1972 to consider various rules, such as bet- 
ting on the underdog when the point spread exceeds some specified level, and identified 
several profitable strategies. Using 1975-1981 data, Tryfos et al. (1984) demonstrated 
that most of these strategies were unprofitable or, if profitable, not at the 5% significance 
level. Those that were significantly profitable all required a syndicate taking advantage 
of different point spreads in different cities. Gandar et al. (1988) found similar negative 
results for these strategies using their 1980-1985 data. 

Golec and Tamarkin (1991) discussed how a model such as Equation (7) can mask 
specific biases: consider that B; measures the average of the biases that do not change 
with the magnitude of the point spread. If half the observations in a data sample include 
a positive bias and the other half a negative bias of equal magnitude, then B; = 0 
(p. 314). The problem is that Equation (7) can deal with only one bias. For instance, 
a bias in favor (or against) the home team can be considered by defining the data, P,, 
relative to the home team. But if there is also a bias for (or against) the favored team, 
that can confound measuring the home team bias. To specifically test for possible biases 
for the favorite and the home teams, Golec and Tamarkin used the following model: 


A; = Bi + BoP + B3 H; + BaF +e, 


where H; is a dummy variable that is one for home teams and zero otherwise, and F; 
is another dummy variable that is one if the team is favored and zero otherwise. Here, 
the test of efficiency is that Bı = B3 + B4 = 0 and B2 = 1. Their empirical results for 
NFL games from 1973 to 1987 indicated that bettors tend to underestimate the home 
field advantage and overestimate the distinction of being the favorite. Interestingly, they 
showed that the home field bias is disappearing over time while the underdog bias 
is actually growing. Despite demonstrating these biases, profits are shown to be slim 
at best in the face of the bookmaker’s commission. Neither bias is present in college 
football. 

Sauer et al. (1988) considered explanatory variables beyond the point spread, such as 
the number of wins prior to this game, fumbles, interceptions, penalties, yards passed, 
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and so on. Regressing these variables on the difference between the offered and the 
actual point spreads, they were unable to reject the hypothesis that their coefficients 
are all jointly zero. They concluded that these variables add essentially no information 
beyond that already in the point spread. Dana and Knetter (1994) allowed two modi- 
fications. Since fumbles, interceptions, and penalties affect the game but are relatively 
uninformative about a team’s ability, they accounted for these unsystematic sources of 
noise. Further, they used a non-linear function of past point spreads. There is scant sup- 
port for any of their models achieving the minimum 52.4% winners needed for profitable 
wagering. 

What is the probability that a team favored to win a football game by p points does 
win the game? Stern (1991) showed that the margin of victory for the favorite is approxi- 
mately normally distributed with mean equal to the point spread and standard deviation 
estimated at 13.86. The probability of winning a game is then 


—P p 
Pr(F > UIP =p) =1-N(—2-)=(—_) 
MTIR RANSA 13.86 / 7 \ 13.86 
where F and U represent actual points scored by the favorite and the underdog, respec- 
tively, and N(-) is the standard normal cumulative distribution function. A linear 
approximation to the probability of winning is 


Pr(F > U|P = p) = 0.50 = 0.03p. 


This formula is accurate to within 0.0175 for p < 6 and is based on data from the 1981, 
1983, and 1984 NFL seasons. Data from 1985 and 1986 indicate that the normal approx- 
imation is valid outside of the original dataset. This approximation is useful for a variety 
of applications, for example, estimating the probability distribution of games won by 
a team, the probability a team makes the playoffs, and the probability distribution of 
season or playoff outcomes for particular teams. 


5. THE BASKETBALL BETTING MARKET 


Do athletes have performance that runs in streaks? Gilovich et al. (1985) using data 
from the 1980-1981 season for the Philadelphia 76ers found that consecutive shots, 
if anything, were negatively autocorrelated. Hence there is no hot hand. They also let 
college players take shots while the players and other observers bet on the outcomes. 
Both players and observers made larger bets after players had just made shots, although 
bet size and actual performance were uncorrelated.!! Camerer (1989, p. 1257) argued 
that belief in the hot hand is a mistake generated by the persistent misunderstanding 
of randomness. People usually expect more alternations and fewer long streaks than 
actually occur in random series. 


11 Albright (1993) studied hitting streaks of baseball players and found no evidence of streaks beyond those 
expected by a statistical model of randomness. Comments on Albright’s work and a rejoinder follow the 
Albright (1993) article in the same journal issue. 
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If the hot hand is believed to exist within a game then bettors might also believe 
in hot and cold streaks across games. And if point spreads reflect a mistaken belief in 
hot hands, then winning-streak teams should do worse than expected. For NBA regular 
season games from 1983 to 1986, Camerer (1989) found this effect to be very weak. 
The effect for losing streaks is slightly stronger, but in neither case is the bias sufficient 
to overcome the bookmaker’s transactions costs. Camerer’s test is premised on the myth 
of the hot hand. Using more data and a test that can also detect the presence of the hot 
hand, Brown and Sauer (1993b) demonstrated that the market believes in the hot hand. 
Neither the hypothesis that the hot hand is real nor that it is a myth could be rejected, 
though. 

Brown and Sauer (1993a) examined the error term in a point spread pricing 
model. While the model’s ex ante predictions explained 85% of the variation in point 
spreads, the error term has significant predictive power. Hence, the error term contains 
unobserved fundamentals, not just noise. 

Sauer (1991) showed that the Las Vegas market point spreads offered at 5 PM 
Eastern time on the day of 5,636 NBA games are an unbiased estimate of the actual 
difference in scores. In a subsample of 700 games that involved injuries to star play- 
ers, the teams with the injured stars performed more than a point worse than the 
point spread. Obviously this is a nonrepresentative sample, though, because it con- 
sists of the games in which the injured star did not play, but not the games where the 
injured player decided after 5 PM to play. Accounting for the likelihood that a star 
with a nagging injury will play, the point spreads provided unbiased estimates of actual 
outcomes. 


6. LOTTERIES 


6.1. Introduction to Lotteries 


For thousands of years, choosing by lots has been used as one means of resolving 
disputes. The first lottery of a more traditional form, where one pays for a chance to 
win, dates at least to the Middle Ages in Italy, studied by Ziemba, Brumelle, Gautier, 
and Schwartz (Ziemba et al., 1986, hereafter ZBGS). Prior to this century, lotteries 
were successfully used in the U.S. for local and state governments, and to fund numer- 
ous causes, such as universities. Corruption, fraud, and moral opposition together with 
lottery restrictions imposed by Congress ended legalized lotteries by the end of the 
nineteenth century, with 35 states going so far as to explicitly prohibit them in their 
constitutions (Clotfelter and Cook, 1991, p. 38). State lotteries continued to be non- 
existent! until 1964 when New Hampshire introduced its lottery. Since then, the U.S. 
has seen an explosive resurgence of lotteries. In 1991, Washington, DC and 32 states 
offered lotteries. Furthermore, ticket sales across states exceeded $19 billion. Of that 


12Other lottery possibilities were available, such as charity raffles, foreign lotteries like the Irish Sweepstakes, 
and illegal lotteries. 


Chapter 21: Efficiency of Racing, Sports, and Lottery Betting Markets 493 


William T. Zie 


213 


amount, prizes were $10.4 billion, administration including advertising was $1.1 bil- 
lion, and net revenue was $7.6 billion.'? Clotfelter and Cook (1990) mention that in the 
course of a year, 60% of the adults who live in lottery states play the lottery at least once 
(p. 105). They also report that per capita sales in lottery states increased from (in 1989 
dollars) $22 in 1975 to $108 in 1989 (p. 105). The present popularity of lotteries is more 
widespread than just the U.S.; in 1986, over 100 countries offered legalized lotteries 
(ZBGS, 1986, p. 2). See ZBGS for more on the history and the practice of lotteries. 

Despite their popularity, with expected returns typically of 40-60%, lotteries are 
usually a poor investment.!* This range is even lower (10-20%) if prizes are not tax- 
free or if they are paid in installments over, say, 20 years, as they typically are in the U.S. 
Canadian and UK prizes are paid in cash and are tax-free. (See ZBGS for calculations 
on the effects of tax and payment in installments.) 

Lotteries take several forms. A simple version has players buy prenumbered tickets 
followed by a random drawing. Instant scratch-off games allow one to determine imme- 
diately if a prize has been won. Another form is the numbers game that requires players 
to match a randomly generated three- or four-digit number. Lastly, players of lotto 
games attempt to match five to seven numbers (with six most common) drawn from 
50 or so numbers (with 49 most common), with the actual choice of the parameters 
varying state by state. A feature distinguishing the numbers and lotto games from the 
other two forms is the player’s act of choosing his or her numbers. For reasons not 
easily explained by traditional economics, the feature of choice is of tremendous impor- 
tance. This was illustrated by Langer (1975) who conducted two lotteries where tickets 
cost $1 and all the money collected was awarded to the winner, that is, the payback 
was 100%. Players in the first lottery were assigned their tickets while those in the 
second lottery chose theirs. As the winner was randomly drawn, subjects in both lot- 
teries had the same chance of winning. However, Langer found that ticket holders in 
the two lotteries viewed their situations differently. When individually approached to 
sell their tickets before the drawing, those in the first lottery demanded a mean payment 
of $1.96, while in the second lottery the mean was $8.67. Langer referred to this phe- 
nomenon as the illusion of control, that choosing one’s ticket improves in some way 
the likelihood one will win. States seem to appreciate this phenomenon and lotteries 
involving choices are very common. 

The prenumbered and instant scratch-off games allow a state to establish winning 
payoffs that exactly conform to any payback percentage. For instance, if the instant 
scratch-off game has $1 tickets and a $100 prize, then a 40% payback can be guaranteed 
by printing 0.4% winning tickets. The numbers game can also involve fixed payoffs. 
For instance, if the game is to pick the three-digit number that is randomly drawn from 
the 1,000 possible three-digit numbers, then a prize of $400 is a 40% payback. The 
difference here is that the state averages a 60% return, but this return is not guaranteed. 
If the winning number has disproportionately many bettors, then the state’s return will 


'3See State Government Finances in 1991 (Washington, DC, Government Printing Office), Table 35. 

14 An exception was the inaugural offering of a new lottery in British Columbia. To create a keen interest in its 
game, participants received six tickets for the price of one, for an expected return of $0.385 times 6 or $2.31, 
a 131% edge. See Ziemba (1995). 
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be less than 60%, and the possibility exists that it could even be negative. Despite this 
difference to the state, the advice to bettors remains: no profitable betting scheme exists 
for lottery games of this sort and each bet’s expected return equals the state’s payback 
percentage. 

For the numbers game, Clotfelter and Cook (1993) document a tendency for the 
public to choose numbers relatively less often immediately after they have been drawn. 
They describe this pattern as a form of the gambler’s fallacy, the belief that if an event 
just occurred, then the likelihood that it will occur again falls.!5 


6.2. Inefficiencies with Unpopular Numbers 


Fixed payoffs for lotteries are not the only possibility. Pari-mutuel payoffs are used 
by all states for lotto games and by Massachusetts for its numbers game. The pari- 
mutuel method allows a state to guarantee its percentage take by having the payoff 
to winners decreasing in the number of winners. Given that all numbers are equally 
likely,!® no system can be developed that will improve the likelihood of winning any 
of the lotteries that have been described. However, if a numbers or lotto game employs 
pari-mutuel payoffs, then by choosing unpopular numbers, upon winning, one is likely 
to share the given prize with fewer other winners. If some numbers are sufficiently 
unpopular, bets with positive expected return may exist, despite the lottery’s low payout 
rate. Chernoff’s (1980, 1981) study of the Massachusetts number game, where players 
pick a number from 0,000 to 9,999, found that numbers with Os, 9s, and to a lesser extent 
8s, tended to be unpopular. He showed that by concentrating on the unpopular numbers, 
bets with a positive expected return were possible. Clotfelter and Cook (1991) provided 
some evidence of this, too, with three days of 1986 data from Maryland’s three-digit 
numbers game. The most popular three-digit choice was 333 which was 9.93 times 
more common than the average. The seven most popular choices were all triples—333, 
777, 555, 444, 888, 666, and 999—and all were at least five times more popular than the 
average number. The least popular was 092, picked 0.23 times as often as the average 
number, and was followed in unpopularity by 086, 887, 884, and 968, all 0.25 times as 
popular as the average. 

Lotto, with its possibility of prizes of tens of millions of dollars, is one of the most 
popular games, and it has received the most media attention. It involves matching six 
numbers drawn without replacement from 50 or so total possible numbers. If T is the 
total possible numbers and D is the number drawn, then the probability of matching is 
1 in T!/(D\(T — D)!). So, for example, the probability of winning when six numbers 


'SMetzger (1985) considered the gambler’s fallacy at the racetrack, and found support for the hypothesis that 
betting on the favorite should be more attractive after a series of longshots have won than after a series of 
wins by favorites. 

16Johnson and Klotz (1993), on the basis of 200 Lotto America winning combinations, suggest that each 
number may not be equally likely. They find that, roughly, smali numbers are drawn more frequently than large 
numbers. They suggest that it may be a consequence of the mechanical mixing process, that small-numbered 
balls are dropped into the urn first. 


Chapter 21: Efficiency of Racing, Sports, and Lottery Betting Markets 495 


William T. Ziemba 215 


are drawn from 49 is 1 in 13,983,816. Most games have prizes for matching fewer than 
all the drawn numbers, too, but it is common for half the prize money to go to the grand 
prize. The long odds mean that none of the perhaps millions of bettors might win in a 
given week (the usual period over which lotto is played). In this event, the grand prize 
jackpot is carried over to the next week. ZBGS studied whether unpopular numbers and 
the carryover can allow a profit. Using several methods, they determined that there were 
unpopular numbers, they were virtually the same ones year to year, and they tended to 
be high numbers (non-birthdays, etc.) and those ending in Os, 9s, and 8s. For instance, 
a regression method based on actual payoffs generated the following as the 12 most 
unpopular numbers: 32, 29, 10, 30, 40, 39, 48, 12, 42, 41, 38, and 18. They were 1,530% 
less popular than average. The most popular number, 7, was selected nearly 50% more 
often than the average number. Using a maximum entropy distribution approach, Stern 
and Cover (1989) identified 20, 30, 38, 39, 40, 41, 42, 46, 48, and 49 as the 10 most 
unpopular numbers while 3, 7, 9, 11, 25, and 27 were the six most popular.!” 

ZBGS showed that expected returns of $1.50 without carryover and up to $2.25 with 
carryover per dollar bet are possible. Does this imply that lotto games can be profitable, 
though? To see that it may not, consider a hypothetical game where you pay $1, choose a 
number between 1 and 1,000,000, and if your number matches the one that is randomly 
selected, then you win $2,000,000. In spite of your edge, you are likely to go bankrupt 
before winning the jackpot. A reduced wager will increase the likelihood that you will 
eventually hit the correspondingly reduced jackpot before you go bankrupt, but your 
expected wealth will suffer. MacLean et al. (1992) analyzed this problem using a model 
contrasting the growth of wealth and the security of wealth and found that lotteries are 
an impractical way for modestly endowed investors to enhance their long-term wealth. 
For instance, by wagering an optimally small amount each round, one’s initial stake 
can be increased tenfold before losing half the stake with a probability close to one. 
However, millions of years of wagering are required, on average. For example, consider 
the hypothesized data in Table 7 and the results in Figure 7. With a more attractive set 
of prizes, the probability is arbitrarily close to one for sufficiently small wagers (see 
MacLean et al., 1992). 

Rather than make optimally small wagers in the face of small probability gambles, 
growth may be improved by increasing the probability of success. For lotteries, this 
can be accomplished by buying more than one combination of numbers. It may even be 
possible in the face of a substantial carryover to profitably purchase most, or perhaps all, 
of the combinations. There have been times when this would have been profitable. In 
practice, though, the transaction costs are enormous because tickets must be purchased 
one at a time. Furthermore, there is the worry that others might also be covering all the 
numbers, to your joint detriment.'® 


"7 See also Joe (1987). Clotfelter and Cook (1991) provided another example of popular numbers from Mary- 
land’s lotto, which has 40 total possible numbers. On the particular day they analyzed, players picked the 
1-2-3-4-5-6 combination over 2,000 times more frequently than the average pick. Had this been the winning 
combination (at a chance of one in 3,838,380), winners would have collected only $193.50! 

18A related opportunity arises with horse racing pick-sixes (pick the winners of six consecutive races) if there 
are substantial carryovers. Covering all pick-six possibilities is easily accomplished at the track and may be 
profitable if few others behave likewise. 
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TABLE 7 Lotto 6/49 Data 


Prizes Probability Value Contribution 
Jackpot 1/13,983,816 $6,000,000 42.9 
Bonus 1/2,330,636 $800,000 34.3 

5/6 1/55,492 $1,000,000 9.0 

4/6 1/1,032 $5,000 14.5 

3/6 1/57 $150 17.6 
Edge 18.1% 
Kelly bet 0.0000001 1 
Number of tickets with $10,000,000 bankroll 11 


Source: MacLean et al. (1992). 
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FIGURE 7 Lotto 6/49—Probability of multiplying before losing half of one’s fortune vs. bet size. 
Source: MacLean et al. (1992). 


Lotto typically involves drawing six numbers. Different states have different total 
possible numbers, though, resulting in very different probabilities of winning. In 1990, 
the extremes were one chance in 974,000 (36 numbers and two picks per ticket) in 
Delaware and one chance in 22,957,480 (53 total numbers and one pick per ticket) in 
California (Cook and Clotfelter, 1993, p. 635). Cook and Clotfelter (1993) explain this 
as a trade-off that states must make between the size of the jackpot and a player’s esti- 
mate of the likelihood that he or she will win. The former is easily learned through 
advertisements and the media. The latter, according to Cook and Clotfelter, is gener- 
ally not well understood but tends to be based on the frequency with which someone 
wins (p. 634). Thus, Delaware could increase its total possible numbers to 53 as in 
California but, given its population, on average there would be many weeks between 
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winners. This would lower the public’s view of the likelihood of winning and the 
attractiveness of purchasing a ticket. On the other hand, given California’s population, 
even with 53 total possible numbers there will usually be a winner each week. This 
nonrational means of probability assessment causes a scale effect whereby per capita 
expenditure increases with the population base of the lottery. Smaller states cannot 
exploit this scale effect themselves but can through forming consortia with other states, 
as happens with the Tri-State lottery (involving Maine, New Hampshire, and Vermont) 
and the states constituting Lotto America. 
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Abstract 


This chapter examines whether the favorite-longshot bias that has been found in 
gambling markets (particularly horse racing) applies to options markets. We investigate 
this for all options on the S&P 500 futures and the FTSE 100 futures for the 17+ years 
from March 1985 to September 2002. Calls on the S&P 500 with both three months and 
one month to expiration display a relationship between probabilities and mean returns 
that are very similar to the favorite bias in horse racing markets. There are slight profits 
from deep in-the-money and at-the-money calls on the S&P 500 futures and increasingly 
greater losses as the call options are out-of-the-money. For three-month and one-month 
calls on the FTSE 100 futures, the favorite bias is not found, but a significant longshot 
bias has existed for the deepest out-of-the-money options. For the put options on both 
markets, and for both three-month and one-month horizons, investors overpay for all put 
options as an expected cost of insurance to protect against downside risk. The patterns 
of mean returns is analogous to the favorite-longshot bias in racing markets. 


JEL Classifications: C15, G13 


Keywords: Longshot bias, gambling, option prices, implied volatilities 


1. INTRODUCTION 


Griffith (1949), McGlothin (1956), Snyder (1978), Ali (1979), and others have doc- 
umented a favorite-longshot bias in racetrack betting.’ High probability-low payoff 
gambles have high expected value and low probability—high payoff gambles have low 
expected value. For example, a 1/10 horse having more than a 90% chance of winning 
has an expected value of about $1.03 per $1 bet, whereas a 100/1 horse has an expected 
value of about 14 ¢. The favorite-longshot bias exists in other gambling markets such as 
sports betting; see Hausch et al. (1994) for a survey of results. 

In Ziemba and Hausch (1986), the expected return per dollar bet versus the odds 
levels are studied for over 300,000 horse races. The North American public underbets 
favorites and overbets longshots. This bias has appeared for many years across all sizes 
of racetrack betting pools. The effect of these biases is that for a given fixed amount 
of money bet, the expected return varies with the odds level; see Figure 1. For bets on 
extreme favorites, there is a positive expected return. For all other bets, the expected 
return is negative. The favorite-longshot bias is monotonic across odds or, equivalently, 
the probability of winning and the drop in expected value is especially large for the 


'While the horse racing favorite-longshot bias is quite stable and pervasive, there exist exceptions in Asian 
racetrack markets (Busche and Hall, 1988; and Busche, 1994). The favorite-longshot bias literature is sur- 
veyed in Hausch et al. (1994, 2008) where many papers are reprinted including the early studies of Griffith 
(1949) and McGtothin (1956). See also the survey of Sauer (1998). Recent papers consistent with the usual 
bias are Hurley and McDouough (1996), Sobel and Raines (2003), and Ottaviani and Sørensen (2003) plus 
the chapters in this volume. 


Chapter 22: The Favorite-Long Shot Bias in S&P500 505 


Robert G. Tompkins, William T. Ziemba, and Stewart D. Hodges 163 


120 F 


Sats CALIFORNIA 


Qe 


40+ 


Effective Track Payback in Percent 
fo) 
oO 
T 


20 f 


Oun EES ane Ps S CEEE el A bo BEA 1 = ee 
DO V ON BARRY KR ANSA AW 
FN VAYS ay aydsoy G2 ANS O 
AAN VOTES VIT O PFF E 


Track Odds (Log Scale) 


FIGURE 1 The effective track payback less breakage for various odds levels in California and New York 
for 300,000 plus races over various years and tracks. 
Source: Ziemba and Hausch (1986). 


lower probability horses. The effect of differing track take/transaction costs is seen in 
the California versus New York graphs. 

Thaler and Ziemba (1988) suggest a number of possible reasons for this bias. 
These include bettors’ overestimation of the chances that longshot bets will win as 
in Kahneman and Tversky (1979). Tversky and Kahneman (1983) argue that bettors 
might overweight small probabilities of winning when the potential payout is large (in 
calculating their utility). Bettors may derive utility simply from the hope associated 
with holding a ticket on a longshot, as it is more fun to pick a longshot to win over a 
favorite and this has more bragging rights. Transaction costs also play a role. Finally, 
they suggest that some bettors may choose horses for irrational reasons, such as the 
name of the horse. Other explanations are that the bias results from the complexity 
of the wagers and the information available to bettors and not from risk-preferences; 
see Sobel and Raines (2003). Ottaviani and Sgrenson (2003), Hurley and McDonough 
(1996), Quandt (1996), and Shin (1991, 1992) provide theoretical models that attempt 
to explain the bias. See also the chapters in this volume, especially Ottaviani and 
Sgrensen (2008). The reasons for the effect varying over time are a combination of 
several factors. These include: (1) the utility bettors gain from betting on longshots and 
the associated preferences over the skewness of returns; (2) the systematic tendency 
for bettors to overestimate the chances of low probability outcomes, and underestimate 
high probability outcomes, and (3) the type and information aspects of informed and 
noise bettors in the race involved. Sobel and Raines (2003) show that the bias is steeper 
for lower quality races compared to higher quality races even on the same day at the 
same track. Consistent with this, Ziemba and Hausch (1987) show that the bias for the 
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Kentucky Derby is much less than is typical. The 1986 graph in Figure | is more flat in 
2007 until the tail dropoff for high odds horses. On this, see, in this volume, the chapters 
by Snowberg and Wolfers (2008) and Ziemba (2008). 

Puts and calls on stock index futures represent leveraged short or long positions on 
the index and their behavior might have similar features to racetrack bets. Demand for 
options comes from both hedging and speculation. The primary use of put options is for 
hedging. 

For the call options, the most obvious hedging demand is to sell them against existing 
holdings of equity. This covered call strategy tends to depress the price of (especially 
out-of-the-money) call options. If this were the sole mechanism for dealing in call 
options, this should result in an increase in the expected return for purchasers of out-of- 
the-money call options. Coval and Shumway (2001) considered the expected and actual 
returns for options on the S&P 500. They find that call options (in a fairly narrow range 
around the current underlying S&P 500 price) have a higher expected return relative to 
the underlying S&P 500 index market. While this result is consistent with a leverage 
effect (the beta of options being much larger than the beta of the S&P 500), the return 
remains less than what it should be if leverage were the sole factor. Coval and Shumway 
(2001) do not consider the investment in deep out-of-the-money call options on the S&P 
500 as we do here.” 

We find significant expected losses for such deep out of the money call options. This 
could be due to speculative activity similar to that for longshot horse race bets. However, 
Bollen and Whaley (2003) showed that buyer-initiated trading in index puts dominates 
the market. Because there are few natural counter-parties to these trades (apart from 
hedge funds), the implied volatilities of these options rise and the implied volatilities 
of the corresponding call options rise due to put-call parity. However, they show that 
the primary choice of buyer-initiated index put trading occurs for the nearest out-of- 
the-money put options. They also stated, “since portfolio insurers generally buy OTM 
puts rather than ITM puts,” this implies that relatively speaking the demand for in-the- 
money puts is less and given that they argue that option mispricing is due to supply and 
demand imbalances at different strike prices, then in-the-money puts would be relatively 
less expensive. By put-call parity, this implies that the costs of the out-of-the-money call 
options would be relatively less expensive and offer a higher return. Nevertheless, our 
results indicate that deep out-of-the-money call options are overpriced. 

Rubinstein (1994) pointed out that the implied volatilities for options on the S&P 
500 changed after the 1987 stock market crash with the prices of out-of-the-money 
put options rising and the prices of out-of-the-money call options falling (relative to 
the price of the at-the-money option). This implied volatility skew (or smile) effect 
has been an active area of research. Buraschi and Jackwerth (2001, p. 523) con- 
clude, “returns on out-from-the-money options are driven by different economic factors 


*During January 1990 to October 1995, which was the period of the Coval and Shumway (2001) analysis, 
the average underlying S&P 500 futures price was approximately 430. They examined puts with strikes 15 
points below and calls with strikes 10 points above. This implies an average percentage difference of strike 
prices that were 3.5% below the current price for puts and 2.3% higher for calls. In this study, we examined 
all traded options with ranges +43.3% of the current S&P 500 price. 
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than those relevant for at-the-money options.” We consider the returns for such deep 
away-from-the-money put options (as opposed to the near-the-money options consid- 
ered by Coval and Shumway, 2001). 

With market imperfections (such as transaction costs or other frictions that disallow 
riskless hedges to be constructed in continuous time) or incomplete markets, option 
prices are no longer uniquely determined by arbitrage, and may be determined (within 
limits) by supply and demand. Dumas et al. (1998) suggest that the behavior of market 
participants may be the reason for the existence of smiles. They state: “with institu- 
tional buying pressures for out-of-the-money puts and no naturally offsetting selling 
pressure, index put prices rise to a level where market makers are eventually willing 
to step in and accept the bet that the index level will not fall below the exercise price 
before the option’s expiration (i.e., they sell naked puts) . . . option series clienteles may 
induce patterns in implied volatilities, with these patterns implying little in terms of the 
distributional properties of the underlying index” (p. 21). 

Figlewski (1989) suggests that volatility smiles exist because of the demands of 
option users. He suggests that the higher prices (and resulting higher implied volatil- 
ities) associated with out-of-the-money options exist because people simply like the 
combination of a large potential payoff and limited risk. He likens out-of-the-money 
options to lottery tickets with prices such that they embody an expected loss. Never- 
theless, this does not dissuade some from purchasing them.? This would suggest that 
investors might be acting irrationally. Poteshman and Serbin (2002) show that this is 
the case for the exercise of exchange-traded stock options. They conclude that the early 
exercise of American calls on stocks during the period of 1996-1999 was in many 
instances “clearly irrational without invoking any model or market equilibrium.” If 
investors act irrationally in this regard, it is also possible they also act irrationally when 
assessing the value of the option and could display similar irrational behavior to other 
speculative endeavors such as gambling. 

We examine the returns from investing in call and put options on stock index futures 
markets and assess whether the mean returns are biased for high leverage situations, 
as they are in various betting markets. To test the hypothesis that options display such 
biases requires a sufficient number of independent observations in actively traded mar- 
kets and a broad enough range of strike prices where such low probability options are 
quoted. We use stock index futures options data, as these markets have existed for a 
sufficiently long period of time to yield enough independent exercise cycles and the 
range of offered strike prices allow the entire probability spectrum to be spanned. These 
instruments may be dominated by institutional investors buying portfolio insurance (as 
suggested by Bollen and Whaley, 2003). Given that such speculative behavior may be 
more likely in option markets with more retail activity, it could be helpful to exam- 
ine individual stock option markets in parallel. However, either such individual stock 
option markets may not have been as actively (and consistently) traded as the stock 


3The purchase of overpriced out-of-the-money puts may be justified by the desire of investors to provide a 
smoother risk profile in volatile markets. 
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index options, yield sufficient independent expiration cycles,’ and often do not offer 
a sufficiently wide range of strike prices to examine extreme probability events, we 
exclude their analysis here. In any case, Bakshi et al. (2003) suggest that the exclusion 
of individual stock options in our study will not seriously affect our general conclu- 
sions. They consider the skew effect for individual equity options written on the S&P 
100 (OEX) cash index and its 30 largest equity components. Using 1991-1995 data, 
they show that the risk-neutral index skews exist and are a consequence of risk aversion 
and fat tailed distributions. Given that they report that individual stock skews are flatter 
than the index skews, this suggests that the option pricing bias is not greater in mar- 
kets with more retail customers (i.e., individual stock options) but is in fact less. These 
equity markets have less systematic risk premium than the index markets and that is 
reflected in the less steep skew. 

As equity index option markets have a wider range of available strike prices and 
trade on a monthly expiration cycle, yielding more independent trials than for stock 
options, we restrict our analysis solely to these markets. We examine two markets that 
have slightly different levels of retail trading activity. Our first is the S&P 500 futures 
options market. According to the Marketing Department of the Chicago Mercantile 
Exchange (and from Large Position reports from the Commodity Futures Trading Com- 
mission) virtually all trading activity for options on the S&P 500 futures comes from 
institutional traders. For options on the Financial Times Stock Exchange (FTSE) 100 
futures traded at the London International Financial Futures Exchange (LIFFE) there 
is more retail involvement. Press releases from LIFFE report that retail involvement in 
these options comprise up to 10% of the total volume (similar to that of the individual 
stock options traded on the LIFFE). This market will provide some insights into the 
impacts of non-professional trading on the favorite-longshot bias. 

Section 2 presents data sources and the methodology for the transformation of option 
prices into odds, so that the results can be compared to the horse racing literature. 
Section 3 presents results for the S&P 500 and FTSE 100 options markets. Section 4 
concludes. 


2. METHODOLOGY 


To investigate whether a favorite-longshot bias exists in option markets requires a trans- 
formation of option prices into odds. In the Black Scholes (1973) equation, N (d2) is 
the forward price of a digital option that pays $1 if F > X. It is the (risk neutral) odds 
at which investors can bet on this event. For a put option, the digital that pays $1 if 
X < Fis N(—dz). As with the racing studies, one must collect a large sample of inde- 
pendent events, determine the odds of certain events occuring, invest a fixed amount in 
each bet (say $1), and examine the a posteriori payoff of that bet. A pool of bets with 
the same odds must be aggregated and the mean payoff returns calculated. Our data 


‘Individual Stock Options are only offered on a quarterly cycle. Given that Stock Index options have had 
monthly expirations since 1987, there are more independent observations to test the hypotheses. 
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is the publicly available settlement prices for the futures contracts and all call and put 
options on the S&P 500 and FTSE 100 index markets on those dates when the options 
had either exactly one month or three months to expiration. The period of analysis 
was the 17.5 years from March 1985 to September 2002 and yielded 69 indepen- 
dent quarterly observations for the S&P 500 and FTSE 100 futures. S&P 500 futures 
began trading on April 21, 1982 and puts and calls on the S&P 500 began trading on 
January 28, 1983. The early years had little volume and few strike prices. Hence, 
our dataset covers the vast bulk of options trading in the S&P 500. For the monthly 
observations (serial options), there were 187 independent observations for the S&P 
500 futures and 124 observations for the FTSE 100 index options markets. The data 
were obtained from the Chicago Mercantile Exchange for the S&P 500 futures and 
options. These option contracts are American style options on futures. The data for the 
FTSE 100 futures and options were obtained from the LIFFE for the European style 
options on futures from 1992 to 2002 and from Gordon Gemmill for the American style 
options on futures prior to 1992. The interest rate inputs were obtained from the British 
Bankers Association (U.S. Dollar or British Pound LIBOR). 

Monthly and quarterly data were used instead of daily data to ensure independence of 
the observations and final outcomes. We identified all expiration dates for all available 
options over the sample period. On that day, we recorded the settlement levels of the 
futures contract (the nearest to the expiration of the futures contract and possibly the 
cash index if that date was a simultaneous expiration of the futures and options contract), 
and all available option prices on this nearby futures contract that had either one month 
or three months to expiration. 

Given that settlement prices were used, it was not necessary to conduct the standard 
filtering procedures such as butterfly arbitrages; see Jackwerth and Rubinstein (1996). 
However, we did remove all options with prices below 0.05 (as for a trade to take place 
the offer price must be at least 0.05). With 17 years of quarterly data, we had 69 quar- 
terly observations in our analysis with an average of 39.1 available strike prices per 
observation for the options on the S&P 500 and 30.8 strikes for options on the FTSE 
100. For the monthly expirations, the average number of strike prices available for the 
S&P 500 options was 39.0 and 28.6 for the FTSE 100. 

The first step is to calculate a measure of the odds of options finishing in the 
money (analogous to the odds in horse racing). Since the options are American, the 
Barone-Adesi and Whaley (1987) approximation has been used to recover the implied 
volatilities, which have then been substituted into the Black (1976) formula to calculate 
the pseudo-European option probabilities [N(d2) and N(—d>)]. For the European style 


>The pit committee of the CME determines the settlement prices rather than by market transactions and this 
could impact our results (especially for OTM options). However, the actual price at the end of the trading 
day could be a bid, mid, or offer price. Given that our analysis considers the payoffs from purchasing options, 
if the actual price that could be dealt at was the bid or mid price (rather than the offer price we implicitly 
assume), the payoffs of the options would be reduced accordingly. Therefore, our estimates of the wealth 
relatives for buying OTM options are more likely to be overly optimistic. 

®To examine the impact of the 1987 crash, we also analyzed the post-crash period of our dataset. The results 
were not materially different (apart from small reductions in the mean wealth relatives of out-of-the-money 
put purchases). 
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options on the FTSE 100, the Black (1976) implied volatilities were directly used.” In 
all markets, the implied volatilities for each option were used to calculate the odds. To 
make a more consistent comparison with horse race betting, the premium for the options 
were expressed in forward value terms. Using Black’s (1976) formula 


Cry = FN(d,) — XN(d2) (la) 
Py, = XN(—d,) — FN(—a2). (1b) 
where 
qe In($) + 40° (T-t) 
ovr 
and 


d,=d, -ov(T =t). 


As we only observe the current option prices C,, and P,,, we transform these to the 

results in Equations (1a) and (1b) by multiplying the observed prices by e”7-® (where 

r is the LIBOR interpolated between adjacent standard maturities as reported by the 

British Bankers Assocation on the observation date, f, and T is the expiration date). 
The terminal payoffs of the options are 


Cr = MAX (Fr — X,0) and Pr = MAX (X — Fr,0), (2) 


respectively. We calculate the wealth relatives of the ratios of these to the initial option 
forward values: in the absence of risk premiums these would be expected to average 
to one. 

An important issue in averaging them is how the wealth relative on each option 
should be weighted. In our data sample, the number of strikes available increases with 
time. We would therefore lose efficiency if we weighted all options equally, as this 
would correspond to investing increasing amounts over time, where, for a given day the 
returns on options at different strikes are not independent. Our first principle is therefore 
to weight each monthly or quarterly period equally, by investing a fixed amount of 
money (e.g., $1) at each date. 

To achieve the same investment amount for the alternative option contracts, the 
number of options purchased equals 


Qc = $1/Cy, and Qp = $1/ Pry, (3) 


respectively, for all calls and puts. Equation (3) suggests that for higher priced options 
(e.g., in-the-money), the quantity purchased will be small and for lower priced options 


TTo be strictly comparable with horse racing odds, we should calculate the cost of a digital option under the 
distribution implied from option prices. However, it is not clear which of the variety of parametric and non- 
parametric approaches for the determination of implied distributions would be most appropriate. In any case, 
this would introduce another possible source of error. 
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(e.g., out-of-the-money), the number of options purchased will be large. We interprete 
the in-the-money options as the favorites and the out-of-the-money options as the 
longshots. 

Unlike horse racing, the (risk neutral) probabilities of payoff in the options markets 
are not expressed as odds but in a continuous probability range from 0% to 100% (and at 
random points). In horse racing, while the bets are expressed as odds, such bets actually 
represent a continous probability range for all bets between discrete categories (and are 
rounded down). As examples, 9/5 bets cover all ranges from 1.80 to 1.99 to 1 and 5/2 
bets covers all bets from 2.00 to 2.49 to 1. 

To determine expected wealth relative at fixed “odds” levels [N(d2) or N(—d2)], 
we use interpolation to estimate what strike and option price would apply. Within the 
range of “odds” that exist on a given day, we linearly interpolate the implied volatility 
between adjacent strikes. With each wealth relative estimated thus, we form a sim- 
ple average of wealth relatives from non-overlapping periods, and can therefore easily 
perform significance tests.” 

Standard significance tests (such as a one-tailed t-test) may be inadequate when the 
sample distribution is not normal. The holding period return distributions of options 
tend to be quite positively skewed, and particularly so for out-of-the-money options, and 
when a risk premium on the underlying increases (for calls) or reduces (for puts) the 
(objective) probability of exercise. Care is therefore needed in testing the significance 
of the mean wealth relative to any given null hypothesis. To address this, we conducted 
Monte Carlo simulations to obtain the distribution of the realized mean wealth relatives 
for samples of suitable sizes (60 for quarterly and 160 for monthly horizons).'° These 
simulations were done under Black Scholes assumptions, with and without a risk pre- 
mium, and for one-month and three-month times to expiration. The confidence intervals 
obtained in this way were noticably different from the t-test intervals that would have 
been applicable for a normal (or nearly normal) distribution. !! 


8We only interpolate and do not extrapolate beyond the range of traded strikes. 

°We compute mean expected payoffs for various odds (probability of finishing in the money) bets to be 
comparable to the racetrack literature. One could beta risk adjust these bets to possibly separate out risk from 
behavorial biases. Coval and Shumway (2001) show how to do this. They find that there are then negative 
expected returns from buying puts and calls. This is consistent with our story that long run expected profits 
accrue to option sellers rather than option buyers. However, the Shumway and Coval results show that if 
you risk adjust the expected return from buying in-the-money 3 month calls is most likely negative and not 
positive as shown in Figure 3. 

10The Monte Carlo simulation entailed simulating 10,000 times the average wealth relative for 60 (quarterly) 
and 160 (monthly) payoffs for call and put options with N(d2) from 0.05 to 0.95 in 0.05 increments. The 
standard error of the wealth relative was determined and the appropriate confidence levels were determined. 
For the inclusion of the risk premium, the negative continuous dividend adjustment to the Merton (1973) 
model proposed in the following section was used to determine the ratio of the expected wealth relative 
compared to the Black Scholes (1973) option price. 

1I Nevertheless, because the confidence intervals most affected are for the right hand tails of out-of-the-money 
options (which tended not to be observed empirically) the use of the simulated intervals makes little difference 
to our results. 
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3. RESULTS 


The first step is to examine what the payoffs of call and put options would be under the 
Black Scholes (1973) model. Although the presence of a risk premium on the equity 
index does not affect the option valuation, it will affect the pattern of realized wealth 
relatives. When risk premiums exist (e.g., in equity markets, see Constantinides, 2002), 
the expected return for the investment in options will differ from the $1 investment. 
Similar in spirit to Coval and Shumway (2001), we examined the expected theoretical 
returns for call and put options using the Black Scholes (1973) formula with no risk 
premiums and risk premiums of 2%, 4%, and 6%. This is done by using —2%, —4%, and 
—6%, respectively, as the continuous dividend rate, using the Merton (1973) dividend 
adjustment, in the Black Scholes formula. The ratios of the option prices are determined 
and plotted as a function of money. This can be seen in Figure 2 for call and put options. 
The calls lie above the $1 investment and the puts lie below the $1 investment. 
Consistent with the theoretical results of Coval and Shumway (2001), who show that 
in a very general setting, call options written on securities with expected returns above 
the risk-free rate should earn expected returns that exceed those of the underlying secu- 
rity and put options should earn expected returns below that of the underlying security. 
They also show that under very general conditions, these divergent expected returns 
would be increasing with the strike price (degree of out-of-the-moneyness). With this 
guidance as to how we would expect option returns to behave as a function of the 
Black Scholes (1973) model with risk premiums, we can now assess the returns actu- 
ally observed for options on the S&P 500 and FTSE 100 futures. The results appear in 
Tables 1 and 2 for three-month options. The call and puts options appear on the left-hand 
and right-hand sides, respectively. For both, the first column is the odds of finishing in 
the money as measured by N(d2) or N(—d2). The next column indicates the number 


Expected Wealth Relatives: Calls and Puts 


ree 2% Risk Premium 
— 4% Risk Premium 
—*— 6% Risk Premium 


Np) 
N(~ ab) 
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FIGURE 2 Expected wealth relatives for call and put options with alternative risk premium levels. 
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TABLE 1 Mean Return per $1 Bet vs. Odds Levels: Three-Month Options on S&P 500 Futures, 1985-2002 


Call options on the S&P 500 futures 


Put options on the S&P 500 futures 


Average SDof 

Odds (%)  #Obs payoff payoff t-test vs. $1 Odds (%) 
.95-1.00 47 1.0010 0.3204 0.02 -95-1.00 
90-95 60 1.0561 0.4605 0.95 -90-.95 
.85-.90 66 1.1231 0.5704 1.76°* .85-.90 
.80-.85 67 1.1407 0.6990 1.66** .80-.85 
.75—.80 63 1.0938 0.5953 1.25 .75—.80 
10-75 64 1.1366 0.7732 1.41* -70-.75 
.65-.70 62 1.1461 0.8648 1.33* .65—.70 
.60-.65 59 1.1311 0.9972 1.01 60-.65 
55-60 58 1.1727 1.1154 1.18 .55-—.60 
.50-.55 54 0.9890 1.0410 -0.08 .50-.55 
.45-.50 56 1.1365 1.3925 0.73 .45-.50 
40-45 58 1.2063 1.6012 0.98 .40-.45 
.35—.40 51 0.9770 1.7015 -0.10 .35-.40 
30-35 54 0.9559 1.6041 -0.20 .30-.35 
.25—.30 59 1.2923 2.7539 0.81 .25-.30 
.20~.25 53 1.1261 2.5378 0.36 .20-.25 
15-20 55 0.8651 2.0742 -0.48 .15-.20 
10-15 56 1,2262 3.6982 0.46 10-15 
.05-.10 53 1.5085 5.3370 0.69 .05-.10 
.00-.05 39 0.0123 0.1345 -44,.89"*** .00-.05 
All All 
options 69 1.1935 2.4124 0.67 options 


#Obs 


37 
44 
50 
54 
53 
51 
53 
54 
50 
56 
51 
56 
56 
62 
64 
65 
64 
66 
66 
57 


69 


Average 
payoff 
0.8998 
0.8662 
0.8426 
0.7937 
0.8137 
0.7879 
0.7702 
0.6215 
0.8225 
0.5807 
0.7344 
0.6785 
0.4744 
0.6257 
0.6316 
0.6426 
0.6696 
0.6602 
0.6432 
0.7525 


0.6212 


SD of 
payoff 
0.4493 
0.5872 
0.7265 
0.8120 
0.8950 
0.9979 
0.9648 
1.0258 
1.2458 
1.1377 
1.4487 
1.5367 
1.2383 
1.6791 
1.8231 
1.9854 
2.2441 
2.6359 
3.4256 
5.6025 


2.5247 


t-test vs. $1 


-1.35* 
-1.50° 
-1,53* 
-1.86** 
-1.51* 
-1.51* 
-1.73* 
-2.70**** 
-1.01 
~2.76"*** 
-1.31* 
-1.57* 
-3,19**** 
-1.76** 
-1.62* 
~1.45" 
-1.18 
-1.05 
-0.85 
0.33 


-1.25 


of observations we have for that particular 5% band (i.e., days for which $1 could be 
invested). The average payoff for a $1 investment in that particular option band appears 
next and is followed by the standard deviation of the option payoffs within the band. 
The final column is a modified one tailed t-test of the hypothesis that the mean return is 
equal to the initial investment of $1 using 


where 


t= (x: = $1) / (sii) 


n 
X=} Xy nand X;; 
j=l 


(4) 


(5) 
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Xij is the wealth relative of the jth option in the ith continuous “odds” range, and À is 
the equity risk premium. Critical levels for the t-test were determined using a Monte 
Carlo simulation. When the hypothesis is rejected at a 90% level or above, the t-statistic 
appears in bold print, “*”, “**,““***", or “**** on the t-statistic indicates that the 
level of significance is greater than the 90%, 95%, 97.5%, or 99% level, respectively. ! 

Figures 3 and 4 provide a graphical view of the mean returns, related to in-the- 
moneyness, in these markets. These are not plots of the data in Tables 1 and 2, but were 
calculated by our continuous interpolation method. This was done solely to allow 


TABLE 2 Mean Return per $1 Bet vs. Odds Levels: Three-Month Options on FTSE Futures, 1985-2002 


Call options on the FTSE futures Put options on the FTSE futures 
Average SDof t-test Average SD of t-test 
Odds (%)  #Obs payoff payoff vs. $1 Odds (%)  #Obs payoff payoff vs. $1 
.95-1.00 32 1.0294 0.3215 0.52 .95-1.00 29 1.0019 0.5058 0.02 
.90-.95 38 1.0485 0.4830 0.62 .90-.95 38 0.8995 0.6101 1.02 
.85-.90 41 1.1025 0.5901 1.11 .85-.90 36 0.8564 0.7274 = 1.19 
80-.85 43 1.1033 0.7033 0.97 -80-.85 37 0.9628 0.8862  —0.25 
.75-.80 44 0.9531 0.6601 —0.47 .75-.80 40 0.9709 0.9221  —0.20 
.70-.75 49 0.9473 0.7491  —0.49 .70-.75 37 0.9201 1.0829 -0.45 
.65-.70 47 1.1151 .0764 0.73 .65-.70 40 1.0430 1.1861 0.23 
.60-.65 49 0.8999 0.7903 -0.89 .60-.65 43 0.8264 1.1006 —1.03 
.55-.60 44 1.1142 .1296 0.67 .55-.60 38 0.9276 1.3428  —0.33 
.50-.55 45 0.9505 1.2324 —-0.27 .50-.55 39 0.8525 1.3050 -0.71 
.45-.50 44 1.0148 .1783 0.08 .45-.50 48 0.8615 1.5273  —0.63 
.40-.45 41 0.8594 1.1062 ~0.81 .40-.45 43 0.8764 1.7370  —0.47 
.35-.40 43 1.1381 8821 0.48 35-40 48 0.7311 1.4967 -1.25 
30-.35 43 0.6177 1.1931  —2,10"** 30-35 44 1.0169 2.2145 0.05 
.25-.30 47 1.0396 2.1356 0.13 .25-.30 53 0.7216 2.2611 —0.90 
.20-.25 38 0.8813 1.9081  —0.38 .20-.25 49 0.6252 1.9079  —1.37 
.15-.20 0 0.4773 1.3779 -2.40*** -15-.20 48 1.0081 3.3628 0.02 
10-15 42 0.9025 2.6841 —0.24 10-15 46 0.4131 1.9507 —2.04""" 
05-10 37 0.1421 0.7891 -—6.60"*** 05-.10 44 0.3600 2.2526 -1.88** 
.00-.05 35 0.1877 1.1102 —4,.32"""" .00-.05 38 0.0893 1.0420 —5.390"" 
All All 
options 70 0.9983 1.4668 —0.01 options 70 0.6016 1.6203 —2.05*** 


12The confidence intervals used were based on a risk premium of 1.75% per quarter, which was the average 
realized risk premium for the two stock index markets over the period of our analysis. Thus, à in Equation 
(4) is 1.0175 for quarterly options and 1.00583 for monthly options. 
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FIGURE3 Mean return per dollar bet vs. odds levels: three-month stock index calls, 1985-2002. 
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FIGURE 4 Average return per dollar bet vs. odds levels: five-month stock index puts, 1985-2002. 
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continuous, smooth curves to be drawn (and does not alter our interpretation of the 
results for the ranges in Tables 1 and 2). 


3.1. Results for Quarterly Options on Stock Index Futures 


For the call options on the S&P 500 futures, we find a similar favorite-longshot bias as 
in horse racing. The deep in-the-money call options in the probability ranges of 65% to 
90% have a mean return of $1.058. For the remaining ranges from 5% to 65%, we can- 
not reject the hypothesis that the return is significantly different from the $1 investment. 
For the deepest out-of-the-money calls (0—5%) the mean returns are only 1.23 cents 
per dollar invested. We reject the hypothesis of an expected return of $1 for the lowest 
5% at a 99% level or above. This result supports the hypothesis of Figlewski (1989) 
that out-of-the-money call options are seen by investors is lottery tickets and investors 
overpay for deep out-of-the-money call options on the S&P 500 futures. Thus, the liter- 
ature on “excessive optimism” in the assessment of risky situations may apply here; see 
Kahneman and Tversky (1979) and Tversky and Kahneman (1983). 

For the call options on the FTSE 100 futures, for call options in the range between 
80% to 100% probabilities, there is no significant difference between the expected pay- 
off and the initial $1 investment. Likewise, for most of the range from 35% to 80%, we 
cannot reject the hypothesis that the return is significantly different from the $1 invest- 
ment. However, for most of the out-of-the-money calls with probabilities less than 35%, 
we reject the hypothesis of an expected return of $1 at above a 99% confidence level. 

The put options on both the FTSE 100 and S&P 500 futures (essentially) all have 
negative mean returns. Moreover, the mean payoff is decreasing as the probabilities 
decrease, analogous to the horse racing favorite-longshot bias. This is also consistent 
with the contentions of Rubinstein and Jackwerth (1996), Dumas et al. (1998), and 
Bollen and Whaley (2002) that investors view put options as insurance policies and are 
willing to accept an expected loss to protect their holdings of equity against downside 
risk losses. To provide a clearer comparison between our results and those of Ziemba 
and Hausch (1986), the figures use similar axes: probabilities equal the reciprocal of the 
odds plus one. This can be seen for sets of stock index options in Figures 3 and 4. 

This allows direct comparison to Figure 2, that presents the theoretical relationship 
between an option’s expected returns and risk premiums. If risk premium was causing 
call options returns to return more than the $1 investment, we would expect Figure 3 
to resemble the upper portion of Figure 2. When the returns are expressed as wealth 
relatives, out-of-the-money options offer a lower rate of return—exactly the opposite 
of what we expect. Therefore we conclude that the mechanism at work is not the risk 
premium argument of Coval and Shumay (2001) but a favorite-longshot bias. 

In Figure 3, in-the-money call options yield more than the $1 invested in each option. 
This is not surprising, given the existence of a risk premium for the equity market.'? 
However, the overall pattern is surprising: we would expect all calls to offer a higher 
rate of return, and for this to increase as the odds lengthen, as in Figure 2. For put options 
on the stock index futures in Figure 4, the mean return tends to decrease, as the option 


13n the probability ranges from 45% to 55%, our results are similar to those of Coval and Shumway (2001). 
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is further out of the money. This is more consistent with Figure 2, but still suggests 
some anomalous behavior. For the S&P 500 put option returns, it is the in-the-money 
ones that are significant, whereas for the FTSE 100 options, only the out-of-the-money 
options are significant. In all cases, the returns on the longshot options are much more 
variable than on the favorites. Thus a much larger deviation of the sample mean from 
one is required, for a given number of observations, in order to reject the hypothesis. 


3.2. Results for Monthly Options on Stock Index Futures 


An enlargment of the data for the index options occurs when one considers options on 
futures with monthly expirations. This also allows a comparison with the three-month 
terms to expiration discussed above. The results appear in Tables 3 and 4 for the one- 
month calls and puts for the S&P 500 futures and FTSE 100 futures, respectively. 


TABLE 3 Mean Return per $1 Bet vs. Odds Levels: One-Month Options on S&P 500 Futures, 1985-2002 


Call options on the S&P 500 futures 


Put options on the S&P 500 futures 


Odds (%) 


.95-1.00 
.90-.95 
.85-.90 
.80-.85 
.75-.80 
.70-.75 
.65-.70 
.60-.65 
.55-.60 
.50-.55 
45-50 
40-45 
35.40 
30-.35 
.25~.30 
20-25 
15-.20 
.10-.15 
.05-.10 
.00-.05 


All 
options 


#Obs 


187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 


{87 


Average 


payoff 


1.0092 
0.9938 
1.0029 
0.9796 
1.0064 
0.9346 
0.9693 
0.9656 
0.9196 
0.9586 
0.8954 
0.9204 
0.9671 
0.8673 
0.9927 
0.7939 
0.9257 
0.7585 
0.6940 
0.50958 


0.9668 


SDof t-test 
payoff vs. $1 Odds (%) 
0.2506 0.50 -95-1.00 
0.3923 —-0.22 90 —.95 
0.4877 0.08 .85-.90 
0.5925 -0.47 .80-.85 
0.6762 0.13 .75—.80 
0.7612 -1.18 -70-.75 
0.8699  —0.48 65-.70 
0.9497  —0.50 .60—.65 
1.0671  —1.03 .55-.60 
1.1004  ~0.51 50-55 
1.2820 ~1.12 45-.50 
1.3652 ~0.80 40~.45 
1.5108 ~0.30 -35-.40, 
1.6712 ~1.09 -30-.35 
1.8245 ~0.05 .25—.30 
1.8764 —1.50* .20-.25 
2.5402 —0.40 .15-.20 
2.8601 —1.15 10-15 
3.5704 —1.17 05-10 
3.9119  ~-1.71*" .00-.05 
All 
2.1085 0.22 options 


#Obs 


187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 


187 


Average 


payoff 


0.9792 
0.9883 
0.9989 
0.9544 
0.9880 
0.9437 
0.9520 
0.9193 
0.8867 
0.9217 
0.8146 
0.9064 
0.7672 
0.7987 
0.7454 
0.6910 
0.6101 
0.5303 
0.4039 
0.0508 


0.5033 


SD of 
payoff 
0.4949 
0.6677 
0.7746 
0.8778 
0.9814 
0.9879 
1.0734 
1.2257 
1.2654 
1.4020 
1.4427 
1.6557 
1.6412 
1.9158 
2.0390 
2.1639 
2.3346 
2.4037 
2.3630 
0.9785 


1.3827 


t-test 

vs. $1 
—1.87** 
—2.01*"" 
—1.18 
—1.67* 
-0.61 
-1.54 
—0.74 
-0.97 
—1.48* 
—0.72 
~—1.54* 
-1.17 
—2,53*** 
-1.64* 
-2,38"** 
-2.55 
-3.287 * 
—3.11**** 
—3.52"*** 
-14.49"" 


-492"""" 


518 Calendar Anomalies and Arbitrage 
176 Chapter 9 « Favorite-Longshot Bias in S&P 500 and FTSE 100 
TABLE 4 Mean Return per $1 Bet vs. Odds Levels: One-Month Options on FTSE Futures, 1985-2002 
Call options on the FTSE futures Put options on the FTSE futures 
Average SDof t-test Average SDof t-test 

Odds (%) #Obs payoff payoff vs. $1 Odds (%)  #Obs payoff payoff vs. $1 
.95-1.00 123 0.9595 0.2694 —1.67* .95-1.00 123 1,0105 0.4717 0.25 
.90-.95 123 0.9719 04011 -0.78 .90-.95 123 0.9847 0.6005 —-0.28 
.85-.90 123 0.9596 0.5020 -0.89 -85-.90 123 1.0229 0.6874 0.37 
-80-.85 123 0.9474 0.6020 —-0.97 -80-.85 123 0.9235 0.7736 = -1.10 
-75-.80 123 0.9761 0.6480 —0.41 .75-.80 123 0.9760 0.9099 -0.29 
70-75 123 0.8576 0.7525 -2.10** 10-.75 123 1.0093 1.0292 0.10 
65-.70 123 0.9296 0.8458 —0.92 65-.70 123 0.9501 1.0715  —0.52 
.60-.65 123 0.8632 0.8191 = —1.85** .60-.65 123 0.8984 1.1686 -0.96 
55-60 123 0.8866 1.0456 —1.20 .55-.60 123 0.9579 1.1831 = -0.39 
50-.55 123 0.8295 0.9372 —2.02** 50-55 123 0.8033 1.2349 -1.77"* 
.45-.50 123 0.9129 1.2141 -0.80 .45-.50 123 0.8161 1.4092 -1.45* 
.40-.45 123 0.7647 1.2268  —2.13** .40-.45 123 0.9409 1.5550 —0.42 
.35-.40 123 0.7588 1.1234 —2,38*" 35-40 123 0.8699 1.6963 -0.85 
.30-.35 123 0.8685 1.6097 —-0.91 .30-.35 123 0.7072 1.7646 = —-1.84** 
.25-.30 123 0.4707  LIH9  —5.28"** .25-.30 123 0.8041 2.0297 -1.07 
.20-.25 123 0.7006 2.0045 —1.66** .20-.25 123 0.5855 2.0360 —2.26** 
.15-.20 123 0.4952 1.4297 -3,92*** .15-.20 123 0.5423 2.4428 —-2.08** 
10-.15 123 0.4779 2.4364 _ —2.38*** 10-15 123 0.5878 2.8156 -1.62* 
.05-.10 123 0.4920 3.6893 —1.53* .05-—.10 123 0.4872 3.3026  —1.72** 
.00-.05 123 0.3427 4.8288 —1.51* .00-.05 123 0,2968 3.4337 -2.27™ 
All All 
options 2.460 0.7926 2.0670 —4,98*** options 2.460 0.6535 2.4630 -6.98*** 


For both the S&P 500 and FTSE 100 option markets, the deep in-the-money one- 
month calls have a mean wealth relative close to one. The further the options are out of 
the money, the lower the mean payoff, as shown in Figures 5 and 6 using our interpola- 
tion method to give returns for odds spaced at every 1%. The pattern is quite striking for 
both markets: the payoff decays monotonically and is similar to the racetrack longshot 
bias shown in Figure 1. However, the only cases of mean returns significantly below 
one are for the FTSE 100 options, for which there is more retail activity. We have the 
usual problem of the large measurement error in expected returns measured with limited 
observations over short horizons. 

For all the put options, the pattern of mean returns for the one-month puts is 
extremely close to those found for the three-month put options.The deepest in-the- 
money puts pay on average the initial bet. Losses increase as the puts are further out 
of the money, displaying a similar longshot bias to Figure 1. 
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FIGURE 5 Mean return per dollar bet vs. odds levels: one-month stock index calls, 1985-2002. 
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FIGURE 6 Mean return per dollar bet vs. odds levels: puts on stock index futures, 1985-2002. 
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Figures 5 and 6 show the mean return for one-month options on the S&P 500 and 
FTSE 100 futures across continuous probability bandwidths. In Figure 5, S&P 500 call 
options return the $1 invested in each option on average. For the FTSE 100, the options 
tend to return slightly less than the initial investment. Therefore, there is no evidence 
of a favorite bias as the expected wealth relative is either equal to the initial investment 
or is statistically significantly below the initial investment. However, for both markets a 
significant longshot bias exists for the out-of-the-money calls in the probability ranges 
from 0% to 15% (the return is significantly less than the initial investment at a 90% 
level). However, the degree of the loss for the deep out-of-the-money calls in the 0% to 
15% range is smaller than for the three-month options seen in Tables 1 and 2. This is 
not surprising as the expected losses occur at an almost steady rate over time, and we 
have only a third of the previous time to expiration. 

The one-month put option returns appear in Figure 6. As with Figure 4 for the three- 
month put options, the mean return tends to decrease, as the option is further out of the 
money. For these options, the shape of the average return function is smoother than 
the three-month pattern. One possible explanation for this comes from Bollen and 
Whaley (2002). They indicate that the greatest concentration of trading in stock index 
put options is for put options with one month or less to expiration. Therefore, with more 
actively traded put options across the entire maturity spectrum, there is less need to 
interpolate. 


4. CONCLUSIONS 


The motivation for this research was to assess whether the favorite-longshot bias that 
has been found in horse racing and other gambling markets applies to options markets. 
The choice of stock index options was made due to a previous conjecture by Figlewski 
(1989) that deep OTM stock index call options are seen by investors as the equivalent of 
low cost/high payoff gambles and Dumas et al. (1998) that stock index put options are 
purchased at higher prices due to the need for insurance. We investigated the favorite- 
longshot bias for options on the S&P 500 Index Futures and FTSE 100 Index Futures 
for the 17+ years, March 1985 to September 2002. 

The deep OTM index call options on the S&P 500 futures and FTSE 100 futures 
have negative mean returns. During the period of 1985-2002, the mean payback from 
the purchase of three-month call options in the probability range of 0-5% was less than 
1.23 and 18.77 cents for every $1 invested in the options for the S&P 500 and FTSE 100, 
respectively. Deep in-the-money three-month calls and one-month calls on the S&P 500 
provide a mean return higher than the initial investment similar to the favorite-longshot 
bias in racetrack markets. 

For the put options on the S&P 500 and FTSE 100, we find evidence consistent 
with the hypothesis of Dumas et al. (1998) that investors pay more for puts than they 
are subsequently worth. The degree of overpaying for these options increases mono- 
tonically as the probability of finishing in the money decreases. These results are also 
consistent with Coval and Shumay (2001), for options in the similar strike price range 
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they considered in their research. We present an empirical study of these index option 
markets rather than a theoretical analysis of why the biases exist. We know that the 
put skew bias steepens when there is a large drop in the underlying index and flattens 
when prices rise and that the shape of the bias varies over time. The one-month versus 
three-month figures document this. The bias in the puts is related to portfolio insurance 
protection against downside risks. Harvey and Siddique (2000) call this skewness pref- 
erence. The deep out-of-the-money calls seem to be, as Figlewski (1989) argued, seen 
as lottery tickets which have low expected returns. The in-the-money and out-of-the- 
money three-month calls seem to reflect the equity risk premium as theoretically shown 
in Figure 2. The one-month calls in this range, which are highly price dependent on 
steep option time decay, are fairly priced for the S&P 500 and negatively priced for the 
FTSE 100. Investors’ aversion to downside risk shown in overpriced puts, especially fol- 
lowing the 1987 crash, is consistent with our data; see also Rubinstein (1994) and Bollen 
and Whaley (2002). This is similar to the pattern observed for the favorite-longshot bias, 
and is the expected cost of insurance. 

The month call options on the S&P 500 and the FTSE 100 have similar patterns, 
but with magnitudes closer to one. Only for the in-the-money calls on the S&P 500 is 
a favorite bias found. The deep in-the-money calls on the FTSE 100 pay an average 
return very close to the intial bet. For the out-of-the-money options, there is a reduction 
in the expected return (like a longshot bias). However, this is not as extreme as for the 
three-month options, and only statistically significant for the FTSE 100 options. 
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Abstract 


This chapter surveys the dosage breeding theory pioneered by Vuilliers, Varola, and 
Roman with major emphasis on two top classic three-year-old thoroughbred races, 
namely, the Kentucky Derby and the Belmont Stakes. Run at 1'4 mi and 1% mi respec- 
tively, they typically are at least 4% and ‘4 mi longer than any of the horses has ever 
raced before. This extra distance, combined with the large fields (especially in the 
Derby), make these two races a difficult test of stamina for horses this young. Bet- 
tors are also challenged because there is no direct evidence of whether a horse has 
the stamina to compete effectively at these distances. The informational content of the 
publicly available, pedigree-based measure of stamina, the Dosage Index, is used with 
simple performance measures to identify a semi-strong-form inefficiency. Statistically 
significant profits, net of transaction costs, could have been achieved during 1946-2006. 
This can be compared to the middle leg of the Triple Crown, the Preakness, run at '%., 
where the Dosage Index provided no advantage. 


JEL Classifications: G10, G14 


Keywords: semi-strong-form market efficiency, capital growth theory, speculative investments, 
sports betting 


1. INTRODUCTION 


The Kentucky Derby annually gathers many of the top three-year-old thoroughbred 
horses at Churchill Downs in Louisville, Kentucky on the first Saturday in May. For the 
horses entered, the race is a new challenge, since its distance of 1 mi is typically at least 
'4 mi longer than any of them has ever raced. The extra distance of the Kentucky Derby, 
usually combined with a large field that includes many top-flight contenders, presents a 
significant test of stamina for these young horses. Two weeks after the Kentucky Derby, 
many of the same horses plus others compete in the Preakness Stakes run at the shorter 
distance of '%,mi at Pimlico racetrack in Baltimore, Maryland. Then three weeks later, 
in early June, the 1⁄4 mi Belmont stakes is held at Belmont Park on Long Island, near 
New York City. And like the Kentucky Derby, typically none of the Belmont Stakes 
entrants has run this far this early in their careers. 

Since the horses in the Kentucky Derby and the Belmont are running at longer dis- 
tances than any earlier races, the public’s assessment of their stamina cannot be easily 
based on their past performances. Without direct evidence of a horse’s ability to run at 
the distances of these two races, bettors have looked to indirect evidence. One approach 
to assessing stamina that has received wide public attention looks at whether the sires in 
a horse’s pedigree have demonstrated a pattern of progeny with stamina. This approach, 
called dosage theory and described in Section 3, is coupled with evidence of success 
in major races as a two-year-old horse to study semi-strong-form efficiency of the 
Kentucky Derby by Bain, Hausch, and Ziemba (Bain et al., 2006; hereafter BHZ), and 
of the Belmont Stakes by Gramm and Ziemba (2007; hereafter GZ). 
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We describe the empirical analyses in BHZ and GZ, and summarize their results 
that dosage theory and success as a two-year-old horse point to semi-strong-form 
inefficiency. Two categories of horses are studied: dual qualifiers and asterisk qualifiers. 

Dual qualifiers are those with a speed/stamina balance consistent with the Derby 
and Belmont characteristics and who were strong enough as two-year-olds to be ranked 
within 101b of the top two-year-old. Asterisk qualifiers pass the dosage test but not 
the 101b test. However, they showed strength by winning a major race early in their 
three-year-old year. 

BHZ and GZ show that both dual qualifiers and asterisk qualifiers did in fact provide 
positive wagering profits in the Derby and even more so in the Belmont. Since the 
dosage theory is silent on the Preakness, these strong positive results do not apply there. 
The Kentucky Derby results are strongest for the pre-1996 period while the Belmont 
results are strong through 2006. There were no dual or asterisk qualifiers in the 2007 
Belmont. 

Roberts (1967) defined a market as being weak-form, semi-strong-form, or strong- 
form efficient if it is not possible to devise a profitable investment scheme net of 
transactions costs based on prices (or, for the racetrack, publicly available odds), based 
on all publicly available information, or based on all information, respectively. For tra- 
ditional financial markets, there is considerable evidence that points to weak-form and 
semi-strong-form efficiency, but little evidence for strong-form efficiency (see Fama, 
1970, 1991; and Keim and Ziemba, 2000, for surveys). 

Weak-form efficiency of the racetrack’s win market means that betting systems 
based solely on the public’s win odds, established through pari-mutuel betting, are 
not profitable. Evidence from many tracks over many years has pointed to weak-form 
efficiency, for example, Ali (1977) and Asch et al. (1982).! Weak-form efficiency of 
the win-betting market is a consequence of four of its features. First, transaction costs 
are high, about 13-20%, depending on track location, so a bettor needs to be con- 
siderably more successful than the average bettor just to break even.? Second, while 
the challenge is substantial, the concept of the win bet is relatively simple. Thus, 
bettors have no confusion about their task. Third, many racetrack bettors approach 
their wagering very seriously and some are very sophisticated. Fourth, for this serious 
audience, there is usually an abundance of relevant information, including records of 
past performances and workouts for all the horses, breeding, earnings, jockey records, 
and so on. 


"An exception may be extreme favorites at odds of 3/10 or less, which have been shown historically to pro- 
duce a small average profit; see Ziemba and Hausch (1986). However, data such as that shown in Ziemba 
(2008) and in Snowberg and Wolfers (2008) indicates that such positive profits currently do not exist with- 
out rebates. Inefficiencies in other more complex markets are more common; see Hausch et al. (1994) 
and other chapters in this volume for such evidence. Weak and semi-strong market efficiencies are dis- 
cussed in the chapters by Hausch and Ziemba (2008) and by Johnson and Sung (2008), respectively, in this 
volume. 

*Large bettors can reduce this take by betting at rebate sites that return a portion of the bet to make the actual 
take about 10%. We do not deal with such bettors here nor with those outside the U.S, who wager on Betfair 
or other betting exchanges against other bettors directly rather than in a pari-mutuel pool as discussed here. 
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For the Kentucky Derby and the Belmont Stakes, the first two of these criteria are 
satisfied for the win market. While the third criterion is met, the Kentucky Derby and 
Belmont Stakes also receive much more interest in North America from casual fans 
than other races. Because, typically, none of the Derby or Belmont Stakes entrants has 
raced at 1'4mi and 1'4 mi, respectively, it can be argued that the fourth criterion is not 
fully met. 

The objective in BHZ and GZ is to determine whether the informational content of 
the Dosage Index, a pedigree-based measure of stamina that is publicly available, in 
conjunction with simple performance measures, is captured in the pari-mutuel win odds 
and, if not, whether it can be used to develop a profitable betting scheme. 

The operation of the racetrack market is discussed in the first section. Section 3 
describes the Dosage Index and performance measures, and their application to the 
Kentucky Derby and Belmont Stakes. The data used by BHZ and GZ in their analy- 
ses are discussed in Section 4. Section 5 describes BHZ’s scheme for estimating each 
betting interest’s win probability based on the public’s odds, the Dosage Index, and the 
performance measures, with an application of this technique to the Kentucky Derby. 
The Kelly capital growth betting model is described in Section 6. The results from 
BHZ2’s joint application of their probability estimation scheme and Kelly wagering for 
the Kentucky Derby appear in Section 7. Sections 8 and 9 discuss GZ’s results for the 
Preakness and Belmont Stakes for 1946-2007. Conclusions appear in Section 10. 


2. THE RACETRACK AS A SEQUENCE OF MARKETS 


Prior to a race, bettors engage in markets that establish prices for the various betting 
opportunities for that race. Betting closes immediately before the race begins, and 
payouts are calculated immediately following the race. For win betting, there are N 
betting interests in a race. Let W; be the total amount bet to win on betting interest 
i=1,...,N. The win pool is 


w=), (1) 


The track payback, Q (generally 0.80 to 0.87 for win bets), is the fraction of each dollar 
bet that is returned to the bettors. The commission, or track take, is 1 — Q. If betting 
interest k wins the race, then win bets on betting interests i # k return zero, while each 
dollar bet on betting interest k returns approximately QW/W,. The actual profit per 
dollar is rounded down to the nearest nickel or dime (this is called breakage). Together 
the track take and breakage constitute the transaction costs.* 


JOur notation deals with one race only. In Section 5, to deal with several races simultancously, we add a 
superscript to our notation to identify the race number. 

4For cach track there is a minimum payout, usually 5%, that the track must return even if there are insufficient 
funds available in QW. If there is a rebate, then the track take is effectively reduced to about 10% so Q is 
about 0.90. 
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Typically, each horse in a race runs as a separate betting interest. However, two or 
more horses in a race that have common ownership typically run as a single betting 
interest known as an entry. In addition, in a race where there would be more betting 
interests than a preset maximum, the horses with the least-impressive credentials are 
grouped as a single betting interest known as the field. A bet to win on an entry or 
the field pays off if any member of that betting interest wins the race. Both have been 
common in the Kentucky Derby but not in the Belmont Stakes. However, the long-time 
restrictions in Kentucky changed in 2001, so there were no entries and no field in the 
Kentucky Derby from 2001 on. 


3. THE DOSAGE INDEX AND PERFORMANCE MEASURES 


The fact that usually no Derby (or Belmont) entrant has raced at 1'4 (1'4) mi prior to the 
race has led to the search for relevant information from alternative sources, including 
the horse’s pedigree. One method of evaluating a thoroughbred’s pedigree, commonly 
known as dosage theory, has its roots in the work of French cavalry officer Lt. Col. 
Jean-Joseph Vuillier, who studied the pedigrees of exceptional thoroughbreds of the late 
nineteenth and early twentieth centuries; see Vuillier (1902, 1906, 1928). The concept 
of thoroughbred dosage evolved through Varola, who developed a patented classifica- 
tion of prominent stallions according to the type of offspring that they produced in a 
series of articles in The British Racehorse; see also Varola (1974, 1980). 

Roman’s (1981) modifications of Varola’s work are known as dosage theory. His 
work was outlined in Leon Rasmussen’s Bloodlines column in the Daily Racing Form 
beginning before the 1981 Kentucky Derby. One product of Roman’s pedigree analysis 
is the Dosage Index (DI), which is based on the categorization of prominent stallions 
in terms of whether they consistently sire offspring with distance proficiencies that are 
incongruous with the dosage profiles of those offspring when that stallion is excluded. 
Classified stallions are called chefs-de-race (or simply chefs);> see Ziemba and Hausch 
(1987) and Roman’s Website (http://www.chef-de-race.com) for the rationale behind 
the selection of recent chefs, and Roman (2002). 

There are five categories for chef classification in Roman’s system: Brilliant, Inter- 
mediate, Classic, Solid, and Professional. The categorization is based on “where they 
[sires] must lie on the speed-stamina spectrum to bring the figures of their descendants 
back in line with those of horses in the general population exhibiting similar perfor- 
mance traits” (Roman, 2001). A chef can be placed in one or two categories. Each time 
a chef appears in a four-generation pedigree, points are awarded in the appropriate cat- 
egory. Points are assigned on a scale of 16 for the first-generation sire, eight for each 
second-generation sire, four for each third-generation sire, and two for each fourth- 
generation sire. Sires that are classified in two categories have their points split. After 


5Mares are not included because they are considered to have too few offspring to identify distance 
proficiencies, while it is not unusual for a stallion to sire 100-200 offspring in a year. 


528 


Calendar Anomalies and Arbitrage 


312 


Chapter 15 + Dosage Breeding Theory 


the 15 sires have been assigned points, the total for each category is entered into the 
Dosage Index formula 


_ Brilliant + Intermediate + 1/2 Classic 


Solid + Professional + 1/2 Classic (2) 

Horses with a high DI have a pedigree that is weighted towards Brilliant and Inter- 
mediate chefs, that is, sires who tend to produce offspring with greater sprinting ability 
than their pedigrees would suggest if that sire were eliminated from the pedigree. Horses 
with a low DI are predicted to have stamina. Very seldom will a stakes-quality horse 
have no dosage points, though some have so few that the D/ is unreliable. The pedi- 
gree and the dosage profile for 2005 Belmont Stakes winner Afleet Alex are shown in 
Table 1. Each pedigree shows the sire and mare for each horse for four generations. For 
example, Afleet Alex’s sire and mare were Afleet and Nurvette, with their respective 
sires and dams shown directly to their right in the pedigree. 

After the initial classification of chefs in 1981, Roman found that no Kentucky Derby 
winner from 1940 to 1980 had a DI exceeding 4.0, despite about one in seven entrants 
having a DJ that high. 

The Dosage Index is not a direct measure of the quality of a horse. One 
quality measure is the experimental free handicap (EFH), an annual ranking of 
two-year-old thoroughbreds that raced in select races in the United States. (see 


TABLE1 Pedigree and Dosage Index Calculation for 2005 
Belmont Stakes Winner Afleet Alex 


Mr. Prospector Raise a Native (B) 
(B/C) Gold Digger 
Venetian Jester 
Polite Lady 
Friendly Ways 
Northern Afleet 
Northern Dancer (B/C) 
Nureyev (C) 
Special 
Nuryette = 
Tentam 
Stellarette 
Square Angel 
[ Roberto (C) 
Silver Hawk 
Gris Vitesse 
Hawkster 
Chieftain 
Strait Lane 
M Hawk Level Sands 
a: aw 
e5 Utrillo II 
Hawaii 
Ethane 
Qualique ko 
Sensitivo 
Dorothy Gaylord 
Gaylord’s Touch 
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Generation Sire Brilliant Intermediate Classic Solid Professional 

1 Northern Afleet 
2 Afleet 

Hawkster 
3 Mr. Prospector 2 

Nureyev 4 

Silver Hawk 
Hawaii 

4 Raise a Native 2 


Venetian Jester 

Northern Dancer 1 i] 
Tentam 
Roberto 2 
Chieftain 
Utrillo I 
Sensitivo 

Total 5 0 9 0 0 


NOTE: Dosage Index = (5 + 0 + 9/2)/(0 +0 + 9/2) = 2.11. 


http://www.jockyclub.com/experimental.asp). Conducted since 1933 by the Jockey 
Club, the EFH assigns the top runners a figurative weight on a scale that usually has 
the two-year-old champion weighted at 126 lb.° Exceptional horses have been weighted 
up to 1301b. Other top horses are assigned lower weights based on perceived ability 
until a cutoff is reached at about 1001b beyond which no more horses are classified. 
Usually there are 15 to 30 horses classified within 101b of the top-weighted horse. 
Roman (1981) observed that starting in 1972, most Kentucky Derby winners were rated 
within 10 1b of the top-weighted horse. This observation led to the designation “dual 
qualifier” for any horse that was weighted within 10 lb of the top-weighted horse on the 
EFH (indicating the quality of the horse) and had a DJ less than or equal to 4.0.’ 

Professional handicapper James Quinn offered a second measure of quality to add 
late-developers to the list. He defined what we call an “asterisk qualifier” to be any 
horse that: (1) won at least one of a selection of premier races prior to the Kentucky 
Derby or Belmont Stakes; (2) had a D7 less than or equal to 4.0; and (3) was not rated 
within 10 Ib on the EFH. A horse is a “dual-or-asterisk qualifier” if it qualifies for one 
of these two categories. 


Ranking horses by weight is a familiar concept at the racetrack. In handicap races, the top horses carry greater 
weight (jockey + saddle + additional weights if necessary) than the less-qualified horses. Handicapping of 
this sort occurs only in select races and is intended to make the race more competitive. 

7Some people expand the dual qualifier category to include any horse that is declared a champion in a country 
other than the U.S. and has a DZ less than or equal to 4.0. In this chapter, only the first definition was used. 
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The objective of BHZ and GZ was to study whether these widely publicized mea- 
sures have any predictive power that is not incorporated into the public’s pari-mutuel 
win odds. If not fully incorporated, then a further objective was to investigate whether 
these measures could be used to determine win probability estimates that are sufficiently 
superior to the public’s so that a profitable wagering scheme based on win betting could 
be developed, despite the significant transaction costs. 


4. DATA ACQUISITION 


This section discusses the nature of the data in BHZ and GZ, while the sources of their 
data are described in the Appendix. 

The public’s win betting pool and results were collected for the period 1946 to 2007. 
For 54 of these years, dollar amounts that the public wagered were found which yielded 
precise values for q;. For the other eight years, only the final win odds for each betting 
interest were available. In these cases it was possible to back out win probabilities that 
were consistent with these odds. 

The EFH listing and pedigree information for each Derby participant were collected 
for each year from 1946 to 2007. The original list of chefs was published in 1981. For 
years prior to 1981, this list was used, which means that the classification of chefs for 
1946 to 1980 is not completely out of sample. The Kentucky Derby hypothetical betting 
begins in 1981, so all betting is based on lists of chefs that were out of sample. For the 
period 1981-1986, the 1981 list was used (see Appendix for explanation). After 1986, 
an updated list of chefs was used each year. The Belmont calculations are for 1981-2006 
and 1946-2006, with no dual or asterisk qualifiers in 2007. 

The major races for asterisk-qualifier status, with their 2007 graded stakes clas- 
sification and the years that they have been run over the interval 1946-2005, were 
the Blue Grass Stakes (G1; 1946-2005), the Flamingo Stakes (currently not run; 
1946-1989, 1992-2001), the Florida Derby (G1; 1952-2005), the Santa Anita Derby 
(G1; 1946-2005), and the Wood Memorial Stakes (G1; 1946-2005). The Flamingo 
Stakes declined in importance before being cancelled, but was included because 
historically it was an important prep race. 


5. APPLICATION OF BREEDING INFORMATION AND 
PERFORMANCE MEASURES TO REFINE ESTIMATED 
WIN PROBABILITIES FOR THE KENTUCKY DERBY 


BHZ developed two models for estimating win probabilities that depended on whether 
a betting interest was a dual qualifier or a dual-or-asterisk qualifier. 

The 1995 Kentucky Derby is used in Table 2 to illustrate the required information. 
Also evident is a complication with regard to accounting for pedigree with entries (and 
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TABLE 2 Sample Input Data: 1995 Kentucky Derby Field 


Qualifier status 


Horse Wi/W Entry EFH Won specified DI Dual Asterisk 
Jambalaya Jazz 0.044 1 115 1.15 

Pyramid Peak — Flamingo 3.00 . 
Serena’s Song 0.189 2 122 2.11 e 

Timber Country 126 3.29 e 

Mecke 107 4.50 

Knockadoon — 3.57 

Citadeed 0.066 Field — 1.60 

In Character — 1.77 

Ski Captain — 3.67 

Lake George — 4,50 

Thunder Gulch 0.033 116 Florida Derby 4.00 e 

Tejano Run 0.087 121 2.38 e 

Jumron 0.126 115 3.80 

Eltish 0.070 123 3.00 ° 
Afternoon Deelites 0.086 124 5.00 

Suave Prospect 0.059 113 4.60 

Talkin Man 0.167 114 Wood Memorial 3.00 ° 
Dazzling Falls 0.029 111 6.20 

Wild Syn 0.042 — Blue Grass 4.33 


NOTE: W;/W: Post-time fraction of win pool. Entry: Entry number or field. EFH: Experimental 
free handicap weight. Blank implies not weighted. (High weight for two-year-olds from 1994 was 
126 tb.) Won specified race: e Winner of a major race prior to Kentucky Derby. D1: Dosage Index: see 
Equation (2). Qualifier status: e Implies meets qualifier requirements. Source: Bain et al., 2006. 


the field): the horses in an entry may not have the same qualifier status. This difficulty 
was handled using the following scheme: 


l. 


2. 


If all members of an entry had the same qualifier status, then the entry was 
considered as one horse with that qualification. 

If one member of the entry was a dual qualifier plus had won any of the desig- 
nated major races prior to the Kentucky Derby, the entry was considered to be a 
dual qualifier regardless of the qualifications of the other member(s) (based on 
the presumption that in most cases most of the public’s attention on the entry was 
due to that horse). 


. If the members of an entry did not all have the same qualifier status, but each 


was either a dual qualifier or an asterisk qualifier, then the entry was viewed as a 
dual-or-asterisk qualifier. 


. Inall other cases the entry was considered to be neutral, that is, neither an asterisk 


qualifier nor a dual-or-asterisk qualifier. 
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The qualifier status of the field was determined in the same manner. For the dual- 
qualifier model there are 67, 0, 10, and 22 betting interests in the respective categories, 
and for the dual-or-asterisk-qualifier model there are 57, 2, 10 and 30 betting interests 
in the respective categories. 

With respect to the dual-qualifier model, of the winners there are 29 that are quali- 
fiers, 26 that are not qualifiers, and 3 that are part of a neutral entry. With respect to the 
dual-or-asterisk-qualifier model, of the winners there are 41 qualifiers, 16 that are not 
qualifiers, and 3 that are part of a neutral entry. In 1998 and 2003 there were no dual 
qualifiers, so those years were ignored in the dual-qualifier modeling. 

The base-case model relates a betting interest’s win probability to the public’s wager- 
ing to see if looking solely at the pools without the “expert information” could lead to a 
profitable betting scheme. 

Let W;’ be the public’s win bet on betting interest i, W/ be the win pool in race j, and 
N/ be the number of betting interests in race j. For race j, define p to be the probability 
that betting interest i wins and define q} = W7 /W/ to be the fraction of the win pool 
bet on betting interest i. For this base case, the following model was used for each race 


j CO 


Pi = o R (3) 
2 (am) 


If ò = 1, then p; = q}. 

BHZ used a standard maximum-likelihood approach to estimate optimal values of 
ò. Consider R independent races and define K = (k1,..., kpg) to be an R-tuple repre- 
senting the winners of the R races, that is, k; is the number of the betting interest that 
won race j. Let Pr, represent the estimated probability based on Equation (3), evalu- 
ated before race j, that betting interest k; wins race j. The probability that the vector 
K corresponds to the winners of the R races is 


R 
PK) =|] Pez (4) 


j=l 


Treating Equation (4) as a likelihood function that depends on 8 gives 


R 
elk [fo (5) 


j=l 


A maximum-likelihood point estimate for 5, namely dy, can be found by maximizing 
the likelihood as a function of 6. The first value for m, was calculated using the first 10 
years of data, namely 1946-1955 inclusive. Thereafter, the value of m, was updated 
for each year using data from 1946 to that year. The win pool fraction for the winner 
and values for ôm, calculated after each year’s race are shown in Figure 1. 
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FIGURE 1 Value for òp after each year’s race and the fraction of the win pool bet on the winner, 
1955-2005. Source: Bain et al., 2006. 


The values for òm are less than 1.0 for the years prior to 1974. This is a conse- 
quence of the public’s more favored betting interests winning less often during this 
period than would have been expected based on the public’s odds. The winners from 
1972 to 1979 were dominated by favorites, culminating with Spectacular Bid in 1979, so 
dmx increases over this interval, reaching a maximum value of 1.12. During the period 
of 1980-2005, the public’s favorite seldom won, so mı tends to decrease to it final 
value 0.92.8 

With values of òm, close to 1.0, Equation (3) generates revised win probabilities that 
differ only slightly from the fraction of the win pool. The greatest ratio p;/q; over the 
interval 1981-2005 is 1.12. This 12% edge is insufficient to offer a positive expected 


8Griffith (1949), McGlothlin (1956), Ali (1977), Asch et al. (1982), and Ziemba and Hausch (1986), among 
others, have demonstrated that the public’s wagering has a strong and stable bias of underbetting the favorites 
and overbetting the longshots. This results in è > 1.0. Ziemba and Hausch (1987) provided evidence that 
this “favorite-longshot bias” is exhibited at the Kentucky Derby but it is weaker, that is more flat, than in 
these earlier studies. The recent advent of rebate and betting exchange wagering has led to a flattening of the 
favorite-longshot bias in recent data since about 1998; see Ziemba (2004, 2008) and Snowberg and Wolfers 
(2008). 
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return on a win bet after accounting for the transaction costs; hence this simple model 
points to weak-form efficiency of the win-betting market over this period. 

The main objective of BHZ was to use this procedure to create models that modified 
the win probability for each betting interest based on whether or not it was considered 
a dual qualifier or considered a dual-or-asterisk qualifier. 

For this case, BHZ viewed the probability of betting interest i winning to be 


. CAN 
Pi = ———., Yin Ym =, B, or 1. (6) 


No 
x (Gn) 


The variable y; = a if betting interest i was a dual qualifier (or dual-or-asterisk qualifier 
depending on the test), y; = 8 if it was classified as not a dual qualifier (or not a dual- 
or-asterisk qualifier if applicable), and y; = 1 if the betting interest was an entry or field 
classified as being neutral. 

Based on Equation (6), BHZ calculated annual maximum-likelihood values for œ 
and B, denoted as am, and Buz, were calculated each year. The initial estimate for 
1956 used the first 10 years of data (1946-1955). Figure 2 illustrates the progression 
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FIGURE 2 Values for æm, and Bmx for dual-qualifier model after each year’s race, 1955-2005. 
NOTE: In the figure, Qua = dual qualifier; Neu = member of a neutral entry; and Non = non-qualifier. 
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FIGURE 3 Value for amu and Bz for dual-or-asterisk qualifier model after each year’s race, 1955-2005. 
NOTE: In the figure, Qua = dual qualifier; Neu = member of a neutral entry; and Non = non-qualifier. 
Source: Bain et al., 2006. 


of am and Bur values for the dual-qualifier model, and Figure 3 shows amy and BML 
values for the dual-or-asterisk-qualifier model. 

In Figures 2 and 3, the critical pattern is the relative magnitude of amı and Ba. 
In Figure 2, for nearly 20 years, am, exceeds By. Consequently, for this period, the 
revised win probability for each dual qualifier is less than the fraction of the money bet 
on it in the win pool. This implies that betting on dual qualifiers, if they had been known, 
would not have been advantageous during that period. In the mid-1970s, dual qualifiers 
began to win consistently, eventually leading to Bu exceeding amr, for the remainder 
of the study period. For this later period, the revised win probabilities for dual qualifiers 
exceed their fraction of the win pool. Figure 3 for dual-or-asterisk qualifiers shows a 
similar pattern, although By begins to exceed amy after only three years. Thus, after 
the third year, the model predicts win probabilities for dual-or-asterisk qualifiers that 
exceed their fraction of the win pool. 

The original and revised estimates of win probabilities for 1995 are in Table 3, where 
a betting interest’s estimated win probability rises if it meets the qualifier criterion. 
This is not necessarily the case if there are many qualifiers because the sum of the 
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TABLE 3 Original and Revised Estimated Win Probabilities 
for the 1995 Kentucky Derby 


Betting interest W/W DQ PDQ DAQ  ppag 
Entry 1 0.044 -1 0.029 0 0.027 
Entry 2 0.189 1 0.242 1 0.220 
Field 0.066 ~1 0.044 -1 0.035 
Thunder Guich 0.033 1 0.075 1 0.076 
Tejano Run 0.087 1 0.144 1 0.137 
Jumron 0.126 -1 0.085 -1 0.068 
Eltish 0.070 1 0.125 1 0.121 
Afternoon Deelites 0.086 -1 0.057 -1 0.046 
Suave Prospect 0.059 -1 0.039 -1 0.031 
Talkin Man 0.167 ~1 0114 1 0.204 
Dazzling Falls 0.029 -1 0.019 -l1 0.015 
Wild Syn 0.042 -1 0.027 -l1 0.022 


NOTE: W;/W: fraction of win pool bet on the betting interest. DQ: 
Indicator is 1 for dual qualifiers, 0 for unclassified entries, and —1 for 
non-qualifiers. ppg: Revised estimated win probability based on dual- 
qualifier model. DAQ: Indicator is 1 for dual-or-asterisk qualifiers, 0 for 
unclassified entries, and —1 for non-qualifiers. ppag: Revised estimated 
win probability based on dual-or-asterisk qualifier model. Source: Bain 
et al., 2006. 


probabilities is unity. The effect of considering asterisk qualifiers is demonstrated by 
Talkin Man, an asterisk qualifier but not a dual qualifier. Talkin Man’s estimated win 
probability varies from 0.114 to 0.204 depending on the criterion used. 

The revised win probability estimates are occasionally sufficiently greater than the 
fraction of the win pool to allow a positive expected return even considering transaction 
costs. 

The percent increase in the win probability over the fraction of the win pool for 
Thunder Gulch is much greater than for Entry 2. There is a general tendency for p;/q; 
to increase for qualifiers as q; decreases, which is a consequence of the power function 
model in Equation (6) together with values of œ < B anda < 1. 


6. THE KELLY BETTING MODEL 


BHZ calculated betting amounts using the Kelly-optimal capital growth model which 
maximizes the expected logarithm of wealth on a race-by-race basis. This approach 
was proposed by Kelly (1956), and was extended and rigorously proved by Breiman 
(1961) and Algoet and Cover (1988). Among its properties are: (1) it maximizes the 
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asymptotic growth rate of wealth; (2) it asymptotically minimizes the expected time to 
reach any specific sufficiently large wealth level; and (3) in the long run, it outperforms 
any other essentially different betting strategy almost surely and asymptotically provides 
infinitely more final wealth than any other essentially different strategy. (See MacLean 
et al., 1992, 2006; and Thorp, 2006, for further properties, and Ziemba and Hausch, 
1986, for simulation results for shorter time horizons.) 

The revised probability of betting interest i winning based on dual-qualifier or dual- 
or-asterisk qualifier status is p;. Let r; be the gross return per dollar bet based on the 
win odds established by the public. (As in Section 1, the superscript indicating the race 
number is suppressed.) The model requires solving the following optimization problem 
for each race: 


N N N 
maximize F nos(1-F tsr) st fi20Yi=1,...,N and Ñ fiS 1. 


m=1 i=l 


(7) 


The decision variable, f;, is the fraction of the current wealth to bet on betting interest 
i. Suppose that the bettor’s initial wealth is w and betting interest i wins. Then w f;r; is 
returned to the bettor after having invested 


N 
wÈ, Sus 
m=) 
for a final wealth of 


N 
v(t Dae sin), 


m=} 


The objective function is the logarithm of final wealth for each betting interest winning, 
weighted by the probability of that betting interest winning. Initial wealth, w, can be 
disregarded in the formulation with the non-negative decision variables the fraction of 
wealth that is bet on each betting interest. The constraints comprise a budget constraint. 

This formulation assumes that the bets are sufficiently small so that they do not influ- 
ence the payout on any betting interest, that is, the bets on betting interest i do not reduce 
ri. For the Kentucky Derby and the Belmont Stakes, the win betting pools are so large 
that a typical bet is unlikely to influence the payouts.° The large pools also permit this 
assumption because the percent bet on each betting interest is assumed to vary little in 
the final few minutes. 


°See Hausch et al. (1981) for a formulation that does account for the bettor’s effect on payouts. 
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7. THE KENTUCKY DERBY, 1981-2006 


In BHZ’s study of semi-strong efficiency of the Kentucky Derby win market, they 
started by revising win probabilities using the base-case model [Equations (3-5)]. 
The revisions were sufficiently close to the public’s win probabilities that expected 
returns were negative for all betting interests during the period 1981-2005. Thus, 
the optimization problem in Equation (7) with these revised probabilities led to no 
wagers. 

For the models based on Equation (6) and on status as a dual or dual-or-asterisk 
qualifier, BHZ started with an initial wealth of $2,500 in 1981. Wealth was updated 
each year based on the bets made and the actual outcome of the race. The wealth history 
for betting from 1981 to 2005 is shown in Figure 4 for dual qualifiers and for dual-or- 
asterisk qualifiers. The overall results are summarized in Table 4. For the years up to 
the mid-1980s, any advantage identified by the model is small and results in small bets. 
Comparing the values in Figure 4 to those in Figures 2 and 3 shows that ayy and By. 
are relatively close over that interval. As the model predicts a greater advantage, the 
amount per bet, and consequently the volatility, grows. 
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FIGURE 4 Wealth level history for Kelly win bets, 1981-2005. Source: Bain et al., 2006. 
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TABLE 4 Profits from Both Models Based on Kelly Bets 
and Revised Win Probabilities 


Model based on qualifier type 


Dual Dual-or-asterisk 
Number of bets 61 107 
Total amount bet $32,828 $66,467 
Number of bet that won 9 14 
Initial wealth $2,500 $2,500 
Final wealth $5,514 $4,889 
Total profit $3,014 $2,389 
Percent return on investment 9.2 3.6 


Source: Bain et al., 2006. 


For comparison, betting $2,500 on the favorite to win from 1981 to 2005 would 
yield a loss of $41,500, betting $200 to win on each dual qualifier would yield a profit 
of $12,920 on $13,200 bet, and a $200 bet to win on each dual-or-asterisk qualifier 
would yield a profit of $7,780 on $23,000 bet (neutral entries excluded as qualifiers). 
The improved return on investment compared to the OCGM in the short run is mostly 
due to a few huge profits on qualifiers Gato del Sol in 1982, Ferdinand in 1986, and 
Thunder Gulch in 1995. 

For both models the betting scheme produced profits during the 1980s and up to 
the mid-1990s, but the performance has been poor since. Several possibilities can be 
considered for this: 


1. The sample size is small. This could mean that the sequence of successes for 
qualifiers for both models from 1972 to 1997 was a short-term run, so that in the 
long run there is nothing to be gained from using either model developed here. 
The limited sample size also implies that the final wealth is sensitive to individual 
race results. To give an idea of the scope, two extreme examples are (i) if a non- 
dual-qualifier had won in 1995, instead of Thunder Gulch, the final wealth for the 
dual-qualifier model would be $2,221, that is, a slight loss overall, and (ii) if 2005 
winner Giacomo had a favorable change of a single dosage point in any category, 
Giacomo would have been a dual qualifier and the final wealth would have been 
$13,877. 

2. It is extremely difficult to make a proper assessment of all of the two-year-olds, 
so the EFH can omit suitable horses. A pointed example occurred in 2003 where 
eventual 2004 Derby winner Smarty Jones was not rated on the EFH. Yet, he 
had overwhelmed a field of state-bred two-year-olds at Philadelphia Park, but 
that race is not counted when determining the EFH. Roman (2005a) lists other 
Derby winners such as Winning Colors and Sunday Silence who were superior 
two-year-olds but were not rated on the EFH. 


540 


Calendar Anomalies and Arbitrage 


324 


Chapter 15 + Dosage Breeding Theory 


3. Classification of chefs is an ongoing exercise. For example, Alydar was classified 
as a chef subsequent to Strike The Gold winning the Derby in 1991, so Strike The 
Gold, who won the Blue Grass Stakes, is not considered as an asterisk qualifier 
here (DJ = 9.00), yet when Alydar was classified, Strike The Gold’s DI was 
reduced to 2.60. Ziemba (1991) wrote a column about this on April 28, 1991, 
one week prior to the 1991 Derby arguing that Alydar should be a classic chef 
as Alydar had numerous classic distance winners. The reclassification of Alydar, 
and at what point, would possibly change other pedigrees. However, BHZ and 
GZ use Roman’s classification, so Strike the Gold is neither a dual nor an asterisk 
qualifier. 

4. Roman (2005b) has pointed out the gradual rise in the DI of Derby winners over 
time, so the failure of the system in the last few years of the study may reflect 
a shift of the overall breed in North America toward speed at shorter distances. 
Real Quiet in 1998, Charismatic in 1999, and Giacomo in 2005 all had DJ values 
greater than 4.00. 

5. The Flamingo Stakes decreased in significance in the final years that it was run. 
Including the Flamingo winner as an asterisk qualifier in recent years was unwar- 
ranted in retrospect. Two possible solutions are to drop the Flamingo at some 
point in the analysis, or switch to the Arkansas Derby as the fifth significant 
prep race. 


Random betting generates expected losses in excess of 16%, due to the 16% track 
take plus breakage. BHZ showed both qualifier designations approximately doubling 
wealth over the betting period. They also used two approaches to address the statistical 
significance of these profits. The first approach treats a betting interest’s win or loss as a 
binomial random variable and then uses a normal approximation. The second simulates 
the set of races assuming random wagering. 

Before considering their first approach, observe that the data in Figure 4 are not 
ideal for addressing statistical significance. Wealth generally grows until the mid- 
to-late 1990s, and then dramatically falls. This pattern of wins and losses leads to 
wealth that is highly variable. Focusing on just the dual-qualifier case, Figure 5 super- 
imposes on Figure 4 the wealth level history assuming that the races were run in 
reverse order, that is, we started in 2005 with $2,500, then updated wealth based 
on our results in 2005 and went to the 2004 race, and so on. Thus, the string of 
large losses occurs early with lower wealth, after which wins are more common. 
The final wealth is identical, since the optimal capital growth system simply deter- 
mines the optimal fraction of wealth to bet each year. (For example, losing 10% one 
year and gaining 20% the other year leads to an overall return of 8%, whatever the 
order of the win and the loss.) Despite the final wealth being the same, the wealth 
histories are very different, as is the appearance of any statistical significance to the 
profits. 

To eliminate the effect of varying wealth, which can dampen or intensify the vari- 
ance in profits, BHZ’s test of statistical significance uses bets and returns in each race 
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FIGURE 5 Betting wealth for Kelly win bets on dual qualifiers with races run forward (1981-2005) and 
run backward (2005-1981). Source: Bain et al., 2006. 


assuming an identical initial wealth each race. They do not update wealth year by year 
(as in Figure 4); instead the initial wealth each year is assumed to be $2,500. 

Let q be the probability of winning a bet in each trial, n be the number of trials, 
c be the amount wagered each trial, and r be the gross return upon winning,'® and let 
X be the random variable representing the number of wins. The probability of profits 
exceeding a constant 7 is 


P[rX -ne >]. (8) 


Assume that the trials are independent. For bets in different races, this assumption is 
reasonable. For multiple bets on the same race—which are common—wins are nega- 
tively correlated, since if one betting interest wins then the others must lose. Negative 
correlation leads to a tighter distribution of wins, so this analysis based on independent 


101m practice, q, c, and r vary across races and even within races if there are multiple wagers. We approximate 
the sequence of wagers by using the average values of these parameters. 
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trials underestimates the statistical significance of the results. Since X is binomially 
distributed, the normal distribution approximates Equation (8) as 


Rene _ ng 
1-o| —— |, (9) 
| =| 


where ® is the cumulative distribution function of a standard N (0, 1) variable. 

For dual qualifiers and assuming an initial wealth of $2,500 each year, there were 
61 bets totaling $7,079, and of these bets, nine won for a gross return of $11,357 and a 
profit of $4,278. Thus, c = 7,079/61 = 116.0 and r = 11,357/9 = 1,262. If the system 
were no better than random betting, then q satisfies rg — c = —0.16c, recognizing the 
16% track take, giving g = 0.07725. Then, by Equation (9), the probability of profits of 
at least the observed level of $4,278 given random betting is 2.0%. Suppose instead that 
the system is better than random betting but only good enough to offer zero expected 
profits. Then q solves rq — c = 0, or q = 0.09192, and, by Equation (9), the probability 
that such a system would produce at least the observed profits is 6.7%. 

For dual-and-asterisk qualifiers, with initial wealth of $2,500 each year, there were 
107 bets totaling $13,268; 14 of these bets won for a gross return of $18,253 and a 
profit of $4,985. Thus, c = 124 and r = 1,304. If the system were no better than random 
betting, then g = 0.07988 and the probability of profits of at least the observed level 
is only 2.6%. Assuming instead that the system is better than random betting but only 
good enough to offer zero expected profits, then q = 0.09509 and the probability that 
such a system would produce at least the observed profits is 10.4%. 

The second approach used by BHZ to address the statistical significance of the results 
involved two simulations for each qualifier designation. The first simulation dealt with 
the question of how likely it would be that profits at the observed level would have 
been generated if their approach was vacuous and, therefore, was essentially random 
wagering. The second simulation asked how likely it would be that the observed profits 
would have been generated if the system was able to improve upon random wagering, 
but only enough to achieve zero expected return on each wager (excluding breakage). 
The algorithm for the first simulation was 


1. Start with a betting wealth of $2,500 in 1981. 

2. Determine the fraction of wealth to wager on each betting interest i for the current 
year based on the Kelly criterion and the (wrong) assumption that our probability 
estimate, p;, is correct. 

3. Randomly select the winner, with the probability of winning for betting interest 
i being qj. 

4. Based on the simulated winner, its payout and our wagers, update wealth. 

5. Repeat steps (2) to (4) for each year in order up to 2005. 


The second simulation differed only in step 3, where the simulation used q;/Q as 
the correct win probability for any betting interest i that received a wager in step 2. The 
expected return on every wager was zero, before accounting for breakage. The collective 
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TABLE 5 Results from 10,000 Betting Simulations with $2,500 Initial Wealth 


Dual qualifier Dual-or-asterisk qualifier 
Simulation 1 Simulation? Simulation! Simulation 2 
Final wealth < $1,000 (%) 54.3 37.0 73.8 49.3 
Final wealth < $2,500 (%) 84.2 72.6 91.5 76.1 
Final wealth > system’s final 3.9 9.2 3l 11.4 
wealth (Table 4) (%) 

Mean final wealth $1,555 $2,449 $1,030 $2,473 
Median final wealth $892 $1,371 $450 $1,022 
Maximum final wealth $67,601 $161,777 $97,336 $409,294 


Source: Bain et al., 2006. 


win probability of the other betting interests was such that probabilities summed to 
one. For example, in a three-horse race with [g), q2, q3] = [0.42, 0.21, 0.37] and Kelly 
bets having been placed on the first two horses based on [ pi, p2, p3], the probability 
that the simulation would select each betting interest as the winner was [0.42, 0.21, 
0.37] for Simulation 1, and was [0.5, 0.25, 0.25] for Simulation 2 after dividing the first 
two fractions by 0.84. The simulations were run 10,000 times each. The results are in 
Table 5. 

For all simulations, losses occurred more than 70% of the time. For Simulation 2 
and for both types of qualifiers, the mean final wealth was close to $2,500, which is 
expected given the modification in step 3 of this simulation. For dual qualifiers, only 
3.9% of the time did Simulation 1 realize profits as high as our observed profit. For 
Simulation 2, the corresponding value is 9.2%. For dual-or-asterisk qualifiers, these 
values for Simulations 1 and 2 are, respectively, 3.1% and 11.4%. 

As a final test, the analyses were conducted using limited data. For example, if the 
year being considered was 1997 and the interval was 25 years, only information from 
1972 to 1996 would have been applied. For the dual-qualifer model, using the entire 
dataset produced the greatest final wealth; while for the dual-or-asterisk model, using 
an interval of 56 years produced a final wealth of $5,305. 


8. THE PREAKNESS STAKES, 1946-2006 


The Preakness Stakes, unlike the Kentucky Derby and Belmont Stakes, does not provide 
a new test of stamina for its competitors. The race is run two weeks after the Kentucky 
Derby and is 1/16 mi shorter in distance. The application of the dosage system would 
likely be negated by the fact that entrants in the Preakness exiting the Kentucky Derby 
have a quantifiable result of their ability to run longer distances. GZ find that dual quali- 
fiers and asterisk qualifiers outperform non-dual qualifiers and non-asterisk qualifiers 
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but have not historically earned a positive wagering return. Using a test for differences 
between two population proportions given in Equation (10), we are able to show that 
dual qualifiers win a statistically greater proportion of races than non-dual qualifiers 
and asterisk qualifiers win a statistically greater proportion of races than non-asterisk 
qualifiers. The test statistic is 


bee (Dag a Podq ) (10) 


where p is a proportion of winners from dual qualifier (dq), non-dual qualifiers (ndq), 
and the pooled sample and Maq is the number of dual qualifiers and naq non-dual 
qualifiers. The 116 dual qualifiers won 28 of 61 Preaknesses between 1946 and 2006. 
This translates into 24.1% (218/116) of dual qualifiers were winners versus only 7.5% 
(33/440) of non-dual qualifiers which is significantly different at a 1% level (z = 5.10). 
The results were similar for asterisk qualifiers (z = 5.36) with 22.0% winners (36/164) 
and non-asterisk qualifiers with 6.4% winners (25/392). 

While dual and asterisk qualifiers won the Preakness more than non-qualifiers, the 
wagering returns were negative for flat win bets on dual and asterisk qualifiers from 
1946 to 2006 (see Table B2 in the Appendix and Figure 6). Wagers on each of the 
116 dual qualifiers would yield a loss of $14.90 (—12.8%). The interval from 1981 to 
2006 which corresponds to the first Roman/Rasmussen publication on dosage in the 
Daily Racing Form, however, does show a net positive return of $2.80 (7.6%) for dual 
qualifiers (see Table B2 in the Appendix and Figure 7). Win wagers on asterisk qualifiers 
lose $39.80 (24.3%) from 1946 to 2006 and $17.10 (—10.4%) from 1981 to 2006. 


Preakness Stakes 
Wealth Level History for $1 Win Bets 
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FIGURE 6 Preakness wealth level history for $1 win bets, 1946-2006. 
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FIGURE 7 Preakness wealth level history for $1 win bets, 1981-2006. 
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FIGURE 8 Belmont Stakes wealth level history for $1 win bets, 1946-2006. 


9. THE BELMONT STAKES, 1946-2006 


The Belmont Stakes provides a unique test of stamina where competitors run a full 
1/4 mi farther than they have before. GZ find that the application of expert information 
from the dosage theory does result in a significant advantage. Table B3 and Figures 8 
and 9 give the wealth level histories for 1946-2006 and 1981-2006, respectively. Both 
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Belmont Stakes 
Wealth Level History for $1 Win Bets 
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FIGURE 9 Belmont Stakes wealth level history for $1 win bets. 1981-2006. 


demonstrate considerable statistically significant positive profits. Year by year dual and 
asterisk results for 1946-2006 have $1 flat win bets beginning in 1946 growing to result- 
ing in $105.55 (98.6%) and $164.85 (113.7%) in profit for dual and asterisk qualifiers, 
respectively. The results for 1981-2006 yield $80.85 (216.3%) and $67.95 (129.7%) in 
profit for dual and asterisk qualifiers, respectively. 

Using differences between two population proportions given earlier in Equation 
(10), gives that the proportion of dual qualifier winners is statistically greater than 
the proportion of non-dual qualifying winners at less than the 1% significance level 
(z = 6.73). Indeed, the dual qualifiers won 29.9% of the time vs. 6.8% for the non-dual 
qualifiers. We reject the hypothesis at Paq = Pydq at a level well below 1% signif- 
icance. The proportion of asterisk qualifier winners is also statistically greater than 
the proportion of non-asterisk qualifier winners at less than the 1% significance level 
(z = 6.26). The asterisk qualifiers won 25.5% of the time versus 6.2% for the 
non-asterisk qualifiers. 


10. CONCLUSIONS 


The racetrack is a useful financial market for testing market efficiency and considerable 
evidence exists in support of the track’s win market being weak-form efficient. This 
chapter, however, summarizes the work of BHZ and GZ that show the win market is not 
semi-strong efficient. 

BHZ and GZ focused on a particular aspect of the Kentucky Derby and Belmont 
Stakes, whose distances of 1⁄4 and 1'4 miles are typically farther than any entrant 
has ever raced, and the Preakness Stakes, which is run between these two races at 
ymi. This lack of direct evidence of an entrant’s stamina for this race has motivated 
the search for indirect evidence. Dosage theory, which analyzes a horse’s pedigree, has 
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been offered as such evidence but it has also been controversial, both in general and 
in its relation to the Kentucky Derby. Other evidence that has been offered includes 
well-publicized rankings of horses and results from recent high-caliber races. 

BHZ did not evaluate the criticisms or the justifications offered for the dosage con- 
cept and for the ranking of two-year-olds, nor did they attempt to refine their application 
to the Kentucky Derby. Instead, they simply merged this publicly available information 
with the public’s win odds to establish “adjusted” win probabilities. They then tested 
these win probabilities within a betting system based on the optimal capital growth 
model and showed statistically significant profits. 

GZ applied this procedure to the 14mi Belmont Stakes, which is run five weeks 
after the Kentucky Derby. From the 1980s to the mid-1990s when the dual qualifiers 
were having very good success in the Derby, GZ’s results in the Belmont were not as 
good. However, in recent years the situation has reversed with superior results in the 
Belmont than the Derby. The betting systems discussed here are two of many strategies 
used by bettors. In the '%. Preakness, the dosage breeding theory is less of a factor, 
which is understandable given that the race is shorter than the Kentucky Derby, and 
thus evidence of a horse’s stamina exists. Even so, a positive return on dual qualifiers 
exists from 1981-2006. The procedure outlined shows that given the pools from a set of 
races for which the strategy is applicable, the simple model given in Equation (6) can 
be used to test the validity of the strategy. 
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APPENDIX A: Data Sources 


A.1. Public’s Wagering 


For races from 1946 to 1991, betting data were taken from tables published in The 
Courier-Journal, a Louisville, Kentucky, newspaper, usually on the Sunday after the 
Kentucky Derby. The pools for 1970 could not be found. There were several discrep- 
ancies in the data for which the published pools did not sum to the totals, or did not 
correspond with published win odds. Adjustments were made for errors for which 
an apparent revision could be made. For 1999-2002, complete pools were obtained 
from the Bloodstock Research Information Services Website (http://www.bris.com). In 
2001, the pools also appeared on the Website for the home of the Kentucky Derby, 
Churchill Downs (http://www.churchilldowns.com). The 2003 pools were sent directly 
by Churchill Downs, and the 2004 and 2005 pools were obtained courtesy of John Swe- 
tye who obtained them from Philadelphia Park’s Phonebet service. From 1992 to 1998, 
the pools recorded in The Courier-Journal did not have all of the bets included. While 
these totals are not available, win odds based on total wagering are available. There- 
fore, for 1970 and 1992-1998 we estimated the total win pool and backed out a set 
of win pool fractions that are consistent with the published win odds. There were no 
dual qualifiers in 1998 and 2003 so those years were excluded from the dual-qualifier 
modeling. 


A.2. Pedigrees 


Pedigree information was taken from The Blood-Horse magazine, the American Pro- 
duce Records, a software database called “The Pedigree Program,” the pedigree 
query Website, http://owl.netscout.com/pedigree (no longer active), the Del Mar Turf 
Club Website, http://www.dmtc.com/dmtc98/Pedigree/, thoroughbred registries, and 
Roman’s Website, http://www.chef-de-race.com. The 2004 data were sent by personal 
communication from Roman to John Swetye who forwarded them to us. 


A.3. Chef-de-Race Listings 


Classifications of chefs were taken from the original 1981 list (Roman, 2000), the 
American Racing Manual for each year from 1986 (the first year that the list was 
included) to 1993, and from Roman’s Website. For the period 1981~1986, the 1981 
list was used. For years prior to 1981, the original list was used. For 2001 to 2003, 
and 2005, Dosage Indices and EFH rankings tabulated by Roman were taken from his 
Website, http://www.chef-de-race.com, and for 2004 they were sent via email from 
Roman (see Section A.2, above). 
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A.4, Experimental Free Handicap Listings 


The EFH listings were taken from the American Racing Manual, The Blood-Horse 
magazine (print and online at http://www.bloodhorse.com), the Thoroughbred Times 
Website, http://www.thoroughbredtimes.com, and Roman’s Website, http://www.chef- 
de-race.com. 


A.5. Results of the Kentucky Derby and Major Races Prior to the 
Kentucky Derby 


The results of the Kentucky Derby were taken from the Daily Racing Form, both 
print and online (http://www.drf.com), press materials from Churchill Downs, and from 
Chew (1974). Recent results charts were obtained from the following Websites: 


About.com Inc. http://horseracing.about.com 
Sportsline.com Inc. http://www.sportsline.com 
CNN/Sports Illustrated —_http://www.sportsillustrated.cnn.com 
Equibase Company, LLC __http://www.equibase.com 

Daily Racing Form, LLC http://www.drf.com 


The results for the major races prior to the Derby were taken from the American 
Racing Manual and lists from the following Websites: 


Blue Grass Stakes: http://www.keeneland.com/liveracing/history.asp 

Flamingo Stakes: http://hialeahpark.com/99/HallofFame/flamingo.htm 

Florida Derby: http://www.thoroughbredchampions.com/library/fladerby.htm 
Santa Anita Derby http://www.revistahipodromo.com/santaanita.html 


Wood Memorial Stakes _http://www.nyra.com/aqueduct/index2.html 


and the out-of-date site, http://www.iglou.com/tbred/tc97/preps, which was run by the 
Thoroughbred Times. 


APPENDIX B: Kentucky Derby, Preakness, and 
Belmont Winners, 1946-2006 


TABLE B1 Kentucky Derby, 1946-2006 


Dual 
Year Winner Odds qualifiers 
1946 Assault* 8.2 5 
1947 Jet Pilot 5.4 4 
1948 Citation 0.4 4 
1949 Ponder 16 5 
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TABLE B1 (continued) 


Winner Odds 
Count Turf 14.6 
Hill Gail 1.1 
Dark Star 24.9 
Determine 43 
Swaps* 2.8 
Needles 1.6 
Iron Leige 8.4 
Tim Tam* 2.1 
Tomy Lee 3.7 
Venetian Way 6.3 
Carry Back 2.5 
Decidedly 8.7 
Chateaugay* 9.4 
Northern Dancer 3.4 
Lucky Debonair* 43 
Kauai King 24 
Proud Clarion 30.1 
Forward Pass* 2.2 
Majestic Prince* 14 
Dust Commander* 15.3 
Canonero II 8.7 
Riva Ridge 1.5 
Secretariat 1.5 
Cannonade 1.5 
Foolish Pleasure 1.9 
Bold Forbes 3 
Seattle Slew 0.5 
Affirmed 1.8 
Spectacular Bid 0.6 
Genuine Risk 13.3 
Pleasant Colony 3.5 
Gato del Sol 21.2 
Sunny’s Halo 2.5 
Swale 34 
Spend A Buck 4.1 
Ferdinand 17.7 
Alysheba 8.4 


Dual 
qualifiers 


— — N WK Fw LEN UN DWN WN De UT FN HW WN LY 


U P NY WwW WwW ww NW WH RA A 


(continued) 
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TABLE B1 (continued) 


Dual 
Year Winner Odds qualfiers 
1988 Winning Colors * 3.4 3 
989 Sunday Silence* 3.1 3 
990 Unbridled 10.8 4 
991 Strike The Gold* 48 4 
992 Lil E. Tee 16.8 3 
993 Sea Hero 12.9 3 
994 Go For Gin 9.1 5 
995 Thunder Gulch 24.5 6 
996 Grindstone 5.9 6 
997 Silver Charm 4 2 
998 Real Quiet 8.4 0 
999 Charismatic 31.3 6 
2000 Fusaichi Pegasus* 23 5 
2001 Monarchos* 10.5 3 
2002 War Emblem 20.5 3 
2003 Funny Cide 12.8 0 
2004 Smarty Jones 4.1 5 
2005 Giacomo 50.3 2 
2006 Barbaro* 6.1 2 


NOTE: Bold indicates dual qualifier, * indicates 
asterisk qualifier. 


TABLE B2 Preakness Stakes, 1946-2006 


Dual qualifiers Asterisk qualifiers 

Return Return Return Return 

Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
946  Assault* 1.4 2 —$2.00 3 -$0.60 
1947 Faultless 42 4 —$0.80 5 —$0.40 
948 Citation 01 3 ~$2.70 3 —$2.30 
1949 Capot 2.5 2 —$1.20 3 —$1.80 
1950 Hill Prince 0.7 2 —$1.50 2 -$2.10 
1951 Bold 4.1 1 —$2.50 2 -$4.10 
1952 Blue Man* 1.6 3 —$5.50 4 -$5.50 
953 Native Dancer 0.2 1 —$5.30 2 -56.30 
1954 Hasty Road 5 2 —$1.30 3 -$3.30 
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TABLE B2 (continued) 


Dual qualifiers 


Asterisk qualifiers 


Year Winner Odds Number 
1955 Nashua 0.3 2 
1956 Fabius 2.5 l 
1957 Bold Ruler 1.4 3 
1958 Tim Tam* 1.1 2 
1959 Royal Orbit 6.6 4 
1960 Bally Ache 17 3 
1961 Carry Back 1 2 
1962 Greek Money 10.9 2 
1963 Candy Spots 1.5 2 
1964 Northern Dancer 2.1 4 
1965 Tom Rolfe 3.6 2 
1966 Kauai King 1 3 
1967 Damascus 1.8 3 
1968 Forward Pass* 1.1 0 
1969 Majestic Prince* 0.6 1 
1970 Personality* 4.5 3 
1971 Canonero II 3.4 3 
1972 Bee Bee Bee 18.7 2 
1973 Secretariat 0.3 1 
1974 Little Current 13.1 1 
1975 Master Derby 23.4 2 
1976  Elocutionist 10.1 4 
1977 Seattle Slew 0.4 1 
1978 = Affirmed 0.5 3 
1979 Spectacular Bid 0.1 3 
1980 Codex* 2.7 2 
1981 Pleasant Colony 15 1 
1982  Aloma’s Ruler 6.9 1 
1983 Deputed Testamony 14.5 0 
1984 Gate Dancer 4.8 1 
1985 Tank’s Prospect 47 0 
1986 Snow Chief 2.6 2 
1987 Alysheba 2 2 
1988 Risen Star 6.8 2 
1989 Sunday Silence* 2.1 2 
1990 Summer Squall 2.4 2 


Return 
46-06 


—$2.00 
—$3.00 
—$3.60 
$5.60 
~$2.00 
—$2.30 
—$2.30 
-$4.30 
—$3.80 
—$4,70 
—$2.10 
-$5.10 
—$8.10 
—$8.10 
-$9.10 
~$12.10 
—$15.10 
-$17.10 
-$16.80 
-$17.80 
—$19.80 
~$12.70 
—$12.30 
~$13.80 
-$15.70 
~$17.70 
~$16.20 
$17.20 
~$17.20 
—$18.20 
—$18.20 
—$20.20 
—$19.20 
~$21.20 
-$23.20 
—$21.80 


Return 
81-06 


Number 


Retum 
46-06 


Return 
81-06 


$1.50 

$0.50 

$0.50 
-$0.50 
-$0.50 
—$2.50 
~$1.50 
~$3.50 
~$5.50 
$4.10 


ww we A EN WN FF WN YW WwW FW WwW WwW WW U SK N 


HK ON = 


Ww WwW fF WwW FS 


—$4.00 
-$5.00 
—$5.60 
-$6.50 
-$3.90 
—$4.20 
-$5.20 
—$8.20 
—$8.70 
-$9.60 
—$8.00 
—$11.00 
—$14.00 
-$13.90 
—$15.30 
—$13.80 
-$17.80 
~$19.80 
~$21.50 
—-$23.50 
~$27.50 
-$20.40 
—$20.00 
—$21.50 
—$23.40 
—$22.70 
~$21.20 
—$23.20 
~$23.20 
—$24.20 
~$24.20 
~$28.20 
$28.20 
~$32.20 
-$32.10 
—$31.70 


(continued) 


$1.50 


-$0.50 
~$0.50 
-$1.50 
~$1.50 
-$5.50 
—$5.50 
—$9,50 
—$9,40 
~$9.00 
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TABLE B2 (continued) 


Dual qualifiers Asterisk qualifiers 
Retum Retum Return Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1991 Hansel 9.1 2 —$13.70 $4.00 2 —$23.60 -—$0.90 
1992 Pine Bluff 3.5 1 -$10.20 $7.50 1 —$20.10 $2.60 
1993 Prairie Bayou 2.2 1 -$11.20 $6.50 2 —$22.10 $0.60 
1994 Tabasco Cat 3.6 3 -$9.60 $8.10 3 —$20.50 $2.20 
1995 Timber Country 1.9 2 —$11.60 $6.10 3 -$23.50 -$0.80 
1996 Louis Quatorze 8.5 2 -$13.60 $4.10 3 —$26.50 -$3.80 
1997 Silver Charm 3.1 1 —$10.50 $7.20 3 -$25.40 -$2.70 
1998 Real Quiet 2.5 0 —$10.50 $7.20 1 -$26.40 -$3.70 
1999 Charismatic 8.4 2 -$12.50 $5.20 4 —$30.40 -$7.70 
2000 Red Bullet 6.2 2 —$14.50 $3.20 4 -$34.40 -$11.70 
2001 Point Given 2.3 3 —$14.20 $3.50 5 —$36.10 ~$13.40 
2002 War Emblem 2.8 0 -$14.20 $3.50 1 —$37.10 -$14.40 
2003 Funny Cide 1.9 0 —$14.20 $3.50 0 ~$37.10 -$14.40 
2004 Smarty Jones 0.7 1 ~$15.20 $2.50 1 —$38.10 -$15.40 
2005 Afleet Alex 3.3 3 —$13.90 $3.80 4 —$37.80 -$15.10 
2006 Bernardini 12.9 1 —$14.90 $2.80 2 —$39.80 -$17.10 
NOTE: Bold indicates dual qualifier, * indicates asterisk qualifier. 
TABLE B3 Belmont Stakes, 1946-2006 
Dual qualifiers Asterisk qualifiers 
~ Retum Retum Return Return” 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1946 Assault* 1.4 1 ~$1.00 2 $0.40 
1947 Phalanx 2.3 2 $0.30 2 $1.70 
1948 Citation 0.2 4 -$2.50 5 —$2.10 
1949 Capot 5.6 2 $2.10 4 $0.50 
1950 Middleground 2.7 3 $2.80 4 $0.20 
1951 Counterpoint 5.15 3 —$0.20 4 $2.35 
1952 One Count 12.8 2 -$2.20 3 —$0.65 
1953 Native Dancer 0.45 1 -$1.75 1 —$0.20 
1954 High Gun 3.45 2 -$3.75 3 -$3.20 
1955 Nashua 0.15 1 -$3.60 1 -$3.05 
1956 Needles 0.65 2 -$3.95 2 —$3.40 
1957 Gallant Man 0.95 1 —$4.95 1 —$4.40 
1958 Cavan 4.5 0 -$4.95 1 -$5.40 
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TABLE B3 (continued) 


339 


Dual qualifiers Asterisk qualifiers 

Return Return Return Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1959 Sword Dancer 1.65 3 —$5.30 4 —$6.75 
1960 Celtic Ash 8.4 2 —$7.30 2 —$8.75 
1961  Sherluck* 65.05 4 —$11.30 5 $52.30 
1962 Jaipur 2.85 1 —$8.45 2 $54.15 
1963 Chateaugay* 45 1 —$9.45 2 $57.65 
1964 Quadrangle 6.55 3 —$4.90 4 $61.20 
1965 Hail to All 2.65 1 -$5.90 1 $60.20 
1966 Amberoid SS 2 —$1.40 2 $64.70 
1967 Damascus 0.8 1 —$2.40 1 $63.70 
1968 Stage Door Johnny 44 1 —$3.40 2 $61.70 
1969 Arts and Letters 1.7 1 -$4.40 2 $59.70 
1970 High Echelon 4.5 2 -$0.90 2 $63.20 
1971 Pass Catcher 34.5 3 $31.60 3 $95.70 
1972 Riva Ridge 1.6 3 $31.20 3 $95.30 
1973 Secretariat 0.1 1 $31.30 3 $93.40 
1974 Little Current 1.5 1 $30.30 2 $91.40 
1975  Avatar* 13.2 2 $28.30 4 $101.60 
1976 Bold Forbes 0.9 1 $29.20 1 $102.50 
1977 Seattle Slew 0.4 1 $29.60 1 $102.90 
1978 Affirmed 0.6 3 $28.20 3 $101.50 
1979 Coastal 44 3 $25.20 3 $98.50 
1980 Temperence Hill 53.4 4 $21.20 5 $93.50 
1981 Summing 79 1 $20.20 —$1.00 l $92.50 —$1.00 
1982 Conquistador Cielo 4.1 1 $19.20 —$2.00 2 $90.50 —$3.00 
1983 Caveat 2.6 1 $21.80 $0.60 3 $91.10 —$2.40 
1984 Swale 1.5 1 $23.30 $2.10 1 $92.60 —$0.90 
1985 Crème Fraiche 2.5 1 $25.80 $4.60 1 $95.10 $1.60 
1986 Danzig Connection 8 2 $32.80 $11.60 2 $102.10 $8.60 
1987 Bet Twice 8 2 $39.80 $18.60 3 $108.10 $14.60 
1988 Risen Star 2.1 1 $38.80 $17.60 3 $105.10 $11.60 
1989 Easy Goer 1.6 3 $38.40 $17.20 4 $103.70 $10.20 
1990 Go And Go 7.5 2 $36.40 $15.20 3 $100.70 $7.20 
1991 Hansel 4.1 3 $38.50 $17.30 3 $102.80 $9.30 
1992 A.P. Indy 1.1 2 $38.60 $17.40 2 $102.90 $9.40 
1993 Colonial Affair 13.9 2 $36.60 $15.40 3 $99.90 $6.40 
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TABLE B3 (continued) 


Dual qualifiers Asterisk qualifiers 

Return Return Retum Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1994 Tabasco Cat 3.4 2 $39.00 $17.80 2 $102.30 $8.80 
1995 Thunder Gulch 1.5 1 $40.50 $19.30 1 $103.80 $10.30 
1996 Editor’s Note 5.8 3 $44.30 $23.10 4 $106.60 $13.10 
1997 Touch Gold 2.65 1 $43.30 $22.10 2 $104.60 $11.10 
1998 Victory Gallop 4.5 1 $42.30 $21.10 2 $102.60 $9.10 
1999 Lemon Drop Kid 29.75 3 $70.05 $48.85 4 $129.35 $35.85 
2000 Commendable 18.8 0 $70.05 $48.85 0 $129.35 $35.85 
2001 Point Given 1.35 3 $69.40 $48.20 4 $127.70 $34.20 
2002 = Sarava 70.25 0 $69.40 $48.20 0 $127.70 $34.20 
2003 Empire Maker* 2 0 $69.40 $48.20 l $129.70 $36.20 
2004 Birdstone 36 1 $105.40 $84.20 l $165.70 $72.20 
2005 Afleet Alex 1.15 2 $105.55 $84.35 2 $165.85 $72.35 
2006 Jazil 6.2 0 $105.55 $84.35 l $164.85 $71.35 


NOTE: Bold indicates dual qualifier, * indicates asterisk qualifier. Crème Fraiche was coupled with dual 
qualifier Stephan’s Odyssey. 
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ABSTRACT The Kentucky Derby features top three-year-old thoroughbred horses. Run at 1 2 miles, it is 
typically at least 1/8 mile longer than any of the horses has raced before. This extra distance, usually 
combined with a large field, makes the race a difficult test of stamina for horses this young. Bettors, because 
there is no direct evidence of whether a horse has the stamina to compete effectively at 1 2 miles, are also 
challenged. The informational content of one publicly available, pedigree-based measure of stamina, the 
Dosage Index, is used with simple performance measures to identify a semi-strong-form inefficiency, and 
to create a betting scheme based on the optimal capital growth model that merges these criteria with the 
public’s opinion. Statistically significant profits, net of transaction costs, could have been achieved during 
the period 1981 to 2005. 


Key WorDs: Semi-strong market efficiency, capital growth theory, speculative investments, sports betting 


1. Introduction 


The Kentucky Derby annually gathers many of the top three-year-old thoroughbred horses at 
Churchill Downs in Louisville Kentucky on the first Saturday in May. For the horses entered, 
the race is a new challenge since its distance of 1} miles is typically at least 1/8 mile longer 
than any of them have ever raced. The extra distance of the Kentucky Derby, usually combined 
with a large field that includes many top-flight contenders, presents a significant test of stamina 
for these young horses. For the Kentucky Derby, the uncertainty about each horse’s stamina 
increases the difficulty for the public of establishing accurate win odds. This potential source of 
semi-strong-form inefficiency in the Kentucky Derby win market is the focus of this paper. 

Roberts (1967) defined a market as being ‘weak-form’, ‘semi-strong-form’ or ‘strong-form’ 
efficient if it is not possible to devise a profitable investment scheme net of transaction costs 
based on prices (or, for the racetrack, publicly available odds), based on all publicly available 
information, or based on all information, respectively. For traditional financial markets, there is 
considerable evidence that points to weak-form and semi-strong-form efficiency, but little evidence 
for strong-form efficiency (see Fama, 1970, 1991 and Keim and Ziemba, 2000 for surveys). 
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Weak-form efficiency of the racetrack’s win market means that betting systems based solely 
on the public’s win odds, established through pari-mutuel betting, are not profitable. Evidence 
from many tracks over many years has pointed to weak-form efficiency (Ali, 1977; Asch et al., 
1982).! Weak-form efficiency of the win market is a consequence of four of its features. First, 
transaction costs are high, about 13-20%, depending on track location, so a bettor needs to be 
considerably more successful than the average bettor just to break even.? Second, while the 
challenge is substantial, the concept of the win bet is relatively simple. Thus, bettors have no 
confusion about their task. Third, many racetrack bettors approach their wagering very seriously 
and some are very sophisticated. Fourth, for this serious audience, there is usually an abundance 
of relevant information, including records of past performances and workouts for all the horses, 
breeding, earnings, jockey records, etc. 

For the Kentucky Derby, the first two of these criteria are satisfied for the win market. While 
the third criterion is met, the Kentucky Derby also receives much more interest in North America 
from casual fans than any other race. Because, typically, none of the Derby entrants has raced at 
1 l miles, it can be argued that the fourth criterion is not fully met. 

The main objective of this paper is to determine whether the informational content of one 
particular pedigree-based measure of stamina that is publicly available, namely the Dosage Index, 
in conjunction with simple performance measures, is captured in the pari-mutuel win odds and, 
if not, whether they can be used to develop a profitable betting scheme. 

The operation of the racetrack market is discussed in the following section. Section 3 describes 
the Dosage Index and performance measures, and their application to the Kentucky Derby. The 
data used in the analysis are discussed in Section 4. Section 5 develops a scheme for estimating 
each betting interest’s win probability based on the public’s odds, the Dosage Index and the 
performance measures. The betting model is described and analysed in Section 6, and conclusions 
are given in Section 7. 


2. The Racetrack as a Sequence of Markets 


Prior to a race, bettors engage in markets that establish prices for the various betting opportunities 
for that race. Betting closes immediately before the race begins, and payouts are calculated 
immediately following the race for single race bets. We focus on the market for win betting. 
Suppose there are N betting interests in a race. Let W; be the total amount bet to win on betting 
interest i = 1,..., N. The total win pool for the race is 


w= wW.. (1) 


The ‘track payback’ Q (generally 0.80 to 0.87) is the fraction of each dollar bet that is returned 
to the bettors. The commission or ‘track take’ is 1 — Q. If betting interest k wins the race, then 
win bets on betting interests i Æ k return zero, while each dollar bet on betting interest k returns 
approximately QW / W,. The actual profit per dollar is rounded down to the nearest nickel or dime 
(this is called ‘breakage’). Together the track take and breakage constitute the transaction costs.* 

Typically, each horse in a race runs as a separate betting interest. However, two or more horses in 
arace that have common ownership typically run as a single betting interest known as an ‘entry’. In 
addition, in a race where there would be more betting interests than a preset maximum, the horses 
with the least-impressive credentials are grouped as a single betting interest known as the ‘Field’. 
A bet to win on an entry or the Field pays off if any member of that betting interest wins the race. 
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Both have been common in the Kentucky Derby. However, the long-time regulations in Kentucky 
changed in 2001, so there were no entries and no Field in the Kentucky Derby from 2001 on. 


3. The Dosage Index and Performance Measures 


The fact that usually no Derby entrant has raced at li miles prior to the race has led to the 
search for relevant information from alternative sources, including the horse’s pedigree. One 
method of evaluating a thoroughbred’s pedigree, commonly known as Dosage Theory, has its 
roots in the work of French cavalry officer Lt.-Col. Jean-Joseph Vuillier, who studied the pedigrees 
of exceptional thoroughbreds of the later 18th to the early 20th century (Vuillier (Pseudonym 
Lottery), 1902, 1906, 1928). The concept of thoroughbred dosage evolved through Varola (1974, 
1980), whose patented classification of prominent stallions according to the type of offspring that 
they produced first appeared in a series of articles in The British Racehorse. 

Roman’s (1981) modifications of Varola’s work are known as Dosage Theory. His work was 
outlined in Leon Rasmussen’s Bloodlines column in the Daily Racing Form beginning before 
the 1981 Kentucky Derby. One product of Roman’s pedigree analysis is the Dosage Index (DI), 
which is based on the categorization of prominent stallions in terms of whether they consistently 
sire offspring with distance proficiencies that are incongruous with the dosage profiles of those 
offspring when that stallion is excluded. Classified stallions are called chefs-de-race (or simply 
chefs)>; see Ziemba and Hausch (1987), Roman’s Web site, http: //www.chef-de-race.com for the 
rationale behind the selection of recent chefs, and Roman (2002). 

There are five categories for chef classification in Roman’s system: Brilliant, Intermediate, 
Classic, Solid and Professional. The categorization is based on ‘where they (sires) must lie on 
the speed-stamina spectrum to bring the figures of their descendants back in line with those of 
horses in the general population exhibiting similar performance traits’ (Roman, 2001). A chef 
can be placed in one or two categories. Each time a chef appears in a four-generation pedigree, 
points are awarded in the appropriate category. Points are assigned on a scale of 16 for the first- 
generation sire, 8 for each second-generation sire, 4 for each third-generation sire, and 2 for each 
fourth-generation sire. Sires that are classified in two categories have their points split. After the 
fifteen sires have been assigned points, the total for each category is entered into the Dosage Index 
formula 

_ Brilliant + Intermediate + 1/2 Classic 
Solid + Professional + 1/2 Classic 


It is evident that horses with a high D/ have a pedigree that is weighted towards Brilliant and 
Intermediate chefs, that is, sires who tend to produce offspring with greater sprinting ability than 
their pedigrees would suggest if that sire was eliminated from the pedigree. Horses with a low 
DI are predicted to have stamina. Very seldom will a stakes-quality horse have no dosage points, 
though some have so few that the D/ is unreliable. The dosage profile for 1997 Kentucky Derby 
winner Silver Charm is shown in Table 1. 

After the initial classification of chefs in 1981, Roman found that no Kentucky Derby winner 
from 1940 to 1980 had a DI exceeding 4.0, despite about 1 in 7 entrants having a DJ that high 
(over the interval 1946—1980 considered here). 

The Dosage Index is not a direct measure of the quality of a horse. One quality measure is 
the ‘Experimental Free Handicap’ (EFH), an annual ranking of two-year-old thoroughbreds that 
raced in select races in the USA (see http://www.jockeyclub.com/experimental.asp). Conducted 
since 1933 by the Jockey Club, the EFH assigns the top runners a figurative weight on a scale 
that usually has the two-year-old champion weighted at 126 pounds.® Exceptional horses have 


(2) 


560 Calendar Anomalies and Arbitrage 


286 R. S. Bain et al. 
Table 1. Dosage Index calculation for 1997 Kentucky Derby winner Silver Charm 


Generation Sire Brilliant Intermediate Classic Solid Professional 


1 Silver Buck 
Buckpasser 8 
Poker 
3 Tom Fool 2 2 
Hail to Reason 4 
Round Table 4 
Wise Margin 
4 Menow 
War Admiral 2 
Turn-to 1 l 
Mahmoud l 1 
Princequillo 1 1 
Nasrullah 2 
Market Wise 
Faultless 


Total 3 > 17 5 0 


Dosage Index = (3 + 5 + 17/2)/(5 + 0 + 17/2) = 1.22. 


been weighted up to 132 pounds. Other top horses are assigned lower weights based on perceived 
ability until a cutoff is reached at about 100 pounds beyond which no more are classified. Usually 
there are 15 to 30 horses classified within 10 pounds of the top-weighted horse. Roman (1981) 
observed that starting in 1972 most Kentucky Derby winners were rated within 10 pounds of the 
top-weighted horse. This observation led to the designation ‘dual qualifier’ for any horse that was 
weighted within 10 pounds of the top-weighted horse on the EFH (indicating the quality of the 
horse) and had a D7 less than or equal to 4.0.7 

Professional handicapper James Quinn offered a second measure of quality to add late- 
developers to the list. He defined what we call an ‘asterisk qualifier’ to be any horse that: (1) won 
at least one of a selection of premier races prior to the Kentucky Derby; (2) had a DI less than or 
equal to 4.0; and (3) was not rated within 10 pounds on the EFH. A horse is a ‘dual-or-asterisk’ 
qualifier if it qualifies for one of these two categories. 

Our objective was neither to judge these measures nor to refine them. Instead, our objective 
was to study whether any predictive power there may be in these widely publicized measures 
was incorporated into the public’s pari-mutuel win odds. If not fully incorporated, then a further 
objective was to investigate whether these measures could be used to determine win probability 
estimates that are sufficiently superior to the public’s so that a profitable wagering scheme based 
on win betting could be developed, despite the significant transaction costs. 


4. Data Acquisition 


This section discusses the nature of the data, while the sources of the data are described in the 
Appendix. 

The public’s win betting pool and results were collected for the period 1946 to 2005. For 52 of 
these years, dollar amounts that the public wagered were found. As a result, for these other eight 
years we did not have the exact fractions that the public wagered on each horse. However, it was 
possible to back out fractions that were consistent with these odds. 
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The Experimental Free Handicap listing and pedigree information for each Derby participant 
were collected for each year from 1946 to 2005. The original list of chefs was published in 1981. 
For years prior to 1981, this list was used, which means that the classification of chefs for 1946 
to 1980 is not completely out of sample. However, the hypothetical betting begins in 1981, so all 
betting is based on lists of chefs that were out of sample. For the period 1981-1986, the 1981 list 
was used (see Appendix for explanation). After 1986, an updated list of chefs was used each year. 

The major races for asterisk-qualifier status, with their 2005 graded stakes classification and 
the years that they have been run over the interval 1946-2005, were the Blue Grass Stakes 
(G1) (1946-2005), Flamingo Stakes (currently not run) (1946-1989, 1992-2001), Florida Derby 
(G1) (1952-2005), Santa Anita Derby (G1) (1946-2005) and the Wood Memorial Stakes (G1) 
(1946-2005). The Flamingo Stakes declined in importance before being cancelled, but was 
included because historically it was an important prep race. 


5. Application of Breeding Information and Performance Measures to Refine Estimated 
Win Probabilities 


Two models were developed for estimating win probabilities that depended on whether a betting 
interest was a dual qualifier or a dual-or-asterisk qualifier. 

The 1995 Kentucky Derby is used in Table 2 to illustrate the required information. Most of the 
possibilities in terms of qualifying are presented in Table 2. Also evident is a complication with 
regard to accounting for pedigree with entries (and the Field): the horses in an entry may not have 
the same qualifier status. This difficulty was handled using the following scheme: 


1. If all members of an entry had the same qualifier status, then the entry was considered as one 
horse with that qualification. 

2. If one member of the entry was a dual qualifier plus had won any of the designated major races 
prior to the Kentucky Derby, the entry was considered to be a dual qualifier regardless of the 
qualifications of the other member(s) (based on the presumption that in most cases most of the 
public’s attention on the entry was due to that horse). 

3. If the members of an entry did not all have the same qualifier status, but each was either a dual 
qualifier or an asterisk qualifier, then the entry was viewed as a dual-or-asterisk qualifier. 

4. In all other cases the entry was considered to be neutral, i.e. neither a qualifier nor not a 
qualifier. 


The qualifier status of the Field was determined in the same manner. For the dual-qualifier model 
there are 67, 0, 10 and 22 betting interests in the respective four categories, and for the dual-or- 
asterisk-qualifier model there are 57, 2, 10 and 30 betting interests in the respective categories. 

With respect to the dual-qualifier model, of the winners there are 29 that are qualifiers, 26 that 
are not qualifiers, and 3 that are part of a neutral entry. With respect to the dual-or-asterisk-qualifier 
model, of the winners there are 41 qualifiers, 16 that are not qualifiers, and 3 that are part of a 
neutral entry. In 1998 and 2003 there were no dual qualifiers so those years were ignored in the 
dual-qualifier modeling. 

We began our modeling with a base-case model that related a betting interest’s win probability 
to the public’s wagering to see if looking solely at the pools without the ‘expert information’ could 
lead to a profitable betting scheme. 

Let W/ be the public’s win bet on betting interest i, W/ be the win pool in race j, and N/ be 
the number of betting interests in race j. For race j, define p} to be the probability that betting 
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Table 2. Sample input data: 1995 Kentucky Derby field 


Qualifier status 


Horse W/W Entry EFH Won specified race DI Dual Asterisk 
Jambalaya Jazz 0.044 115 1.15 

Pyramid Peak 1 - Flamingo 3.00 ° 
Serena’s Song 0.189 122 2.11 ° 

Timber Country 2 126 3.29 e 

Mecke 0.066 Field 107 4.50 

Knockadoon Field - 3.57 

Citadeed Field - 1.60 

In Character Field - 1.77 

Ski Captain Field — 3.67 

Lake George Field = 4.50 

Thunder Gulch 0.033 116 Florida Derby 4.00 ° 

Tejano Run 0.087 121 2.38 e 

Jumron 0.126 115 3.80 

Eltish 0.070 123 3.00 e 

Afternoon Deelites 0.086 124 5.00 

Suave Prospect 0.059 113 4.60 

Talkin Man 0.167 114 Wood Memorial 3.00 . 
Dazzling Falls 0.029 111 6.20 

Wild Syn 0.042 — Blue Grass 4.33 

Wi/ W Post-time fraction of win pool. 

Entry Entry number or Field. 

EFH Experimental Free Handicap weight. Blank implies not weighted. (High weight 


for two-year-olds from 1994 was 126 pounds.) 
Won specified race Winner of a major race prior to Kentucky Derby. 
DI Dosage Index: see Equation 2. 
Qualifier status e implies meets qualifier requirements. 


interest 7 wins and define qj = wj / W! to be the fraction of the win pool bet on betting interest i. 
For this base case, the following model was used for each race 


dn Oa (3) 

Bae (qn)? 
If 6 is 1 then pi equals qi ; 

We used a standard maximum-likelihood approach to estimate the optimal values of 6. Consider 
R independent races and define K = (k,,..., kr) to be an R-tuple representing the winners of 
the R races, i.e. k; is the number of the betting interest that won race j. Let Pi, represent the 
estimated probability based on Equation 3, evaluated before race j, that betting interest k; wins 
race j. The probability that the vector K corresponds to the winners of the R races is 


R 
P(K|8) = | | pi,- (4) 
j=l 
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Transforming Equation 4 into a likelihood function that depends on 6 gives 


R 
LIK) x [| p}. (5) 


j=l 


A maximum-likelihood point estimate for 5, namely 5,4, can be found by maximizing the like- 
lihood as a function of 5. Our first value for dy, was calculated using the first 10 years of data, 
namely 1946-1955 inclusive. Thereafter, the value of ôm; was updated for each year using data 
from 1946 to that year. The win pool fraction for the winner and values for 5y, calculated after 
each year’s race are shown in Figure 1. 

The values for ôm; are less than 1.0 for the years prior to 1974. This is a consequence of the 
public’s more favored betting interests winning less often during this period than would have 
been expected based on the public’s odds. The winners from 1972 to 1979 were dominated by 
favorites, culminating with Spectacular Bid in 1979, so dy, increases over this interval, reaching 
a maximum value of 1.12. During the period 1980-2005, the public’s favorite seldom won, so 
ôm, tends to decrease to it final value 0.92.8 

With values of 5y;, close to 1.0, Equation 3 generates revised win probabilities that differ 
only slightly from the fraction of the win pool. The greatest ratio p/q; over the interval 
1981-2005 is 1.12. This 12% edge is insufficient to offer a positive expected return on a win bet 
after accounting for the transaction costs; hence this simple model points to weak-form efficiency 
of the win market over this period. 
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Figure 1. Value for ôm after each year’s race and the fraction of the win pool bet on the winner, 1955-2005 
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Our main objective was to use this same procedure to create models that modified the win 
probability for each betting interest based on whether or not it was considered a dual qualifier or 
considered a dual-or-asterisk qualifier. 

For this case, the probability of betting interest i winning is 


Fic ET 
wr Gh) 


The variable y equals œ if betting interest k was a dual qualifier (or dual-or-asterisk quali- 
fier depending on the test), 8 if it was classified as not a dual qualifier (or not a dual-or-asterisk 
qualifier if applicable), and 1 if the betting interest was an entry or Field classified as being neutral. 

Based on Equation 6, maximum-likelihood values for a and £, denoted as ay, and mz, were 
calculated each year. The initial estimate was to predict for 1956, using the first ten years of data 
(1946-1955). Figure 2 illustrates the progression of ay, and By, values for the dual-qualifier 
model, and Figure 3 shows ay, and By, values for the dual-or-asterisk-qualifier model. 

In these two figures, the critical pattern is the relative magnitude of œm, and mz. In Figure 2, 
æm, usually exceeds By, until after the 1980 race. Consequently, for this period, the revised 
win probability for each dual qualifier is usually less than or not significantly greater than the 
fraction of the money bet on it in the win pool (from 1956 to 1980 inclusive, only four ratios 


> Vis ¥m =, Bor. (6) 
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Figure 2. Values for «mz and By, for dual-qualifier model after each year’s race, 1955-2005 
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Values for Ay, and By 
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Figure 3. Value for œ mz and fmz, for dual-or-asterisk qualifier model after each year’s race, 1955-2005 


of revised win probability to win pool fraction are greater than 1.1 of which only one is greater 
than 1.15). This implies that betting on dual qualifiers, if they had been known, would not have 
been advantageous during that period. In the mid-1970s, dual qualifiers began to win consistently, 
eventually leading to By, exceeding œm, for the remainder of the study period. From 1983 on, 
the revised win probabilities for dual qualifiers exceed their fraction of the win pool. Figure 3 
for dual-or-asterisk qualifiers shows a similar pattern, although By, begins to exceed ay, after 
only three years. Thus, from 1959 on, the model predicts win probabilities for dual-or-asterisk 
qualifiers that exceed their fraction of the win pool over 91% of the time. For example, the original 
and revised estimates of win probabilities for 1995 are in Table 3. 

In Table 3 a betting interest’s estimated win probability rises if it meets the qualifier criterion. 
(This is not necessarily the case if there are many qualifiers because the sum of the probabilities 
is unity.) The effect of considering asterisk qualifiers is demonstrated by Talkin Man, an asterisk 
qualifier but not a dual qualifier. His estimated win probability varies from 0.114 to 0.204 
depending on the criterion used. 

The revised win probability estimates are occasionally sufficiently greater than the fraction of 
the win pool to allow a positive expected return even considering transaction costs. 

The percentage increase in the win probability over the fraction of the win pool for Thunder 
Gulch is much greater than for Entry 2. There is a general tendency for p;/q; to increase for 
qualifiers as q; decreases, which is a consequence of the power function model in Equation 6 
together with values of a < 6 anda < 1. 
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Table 3. Original and revised estimated win probabilities for the 1995 Kentucky Derby 


Betting Interest W/W DQ PDQ DAQ PDAQ 
Entry 1 0.044 -1 0.029 0 0.027 
Entry 2 0.189 1 0.242 1 0.220 
Field 0.066 -1 0.044 -1 0.035 
Thunder Gulch 0.033 1 0.075 1 0.076 
Tejano Run 0.087 1 0.144 1 0.137 
Jumron 0.126 -=l 0.085 -i1 0.068 
Eltish 0.070 1 0.125 1 0.121 
Afternoon Deelites 0.086 -1 0.057 -! 0.046 
Suave Prospect 0.059 -1 0.039 -1 0.031 
Talkin Man 0.167 -1 0.114 1 0.204 
Dazzling Falls 0.029 -1 0.019 -1 0.015 


Wild Syn 0.042 -1 0.027 -1 0.022 


W/W fraction of win pool bet on the betting interest. 


DQ indicator is 1 for dual qualifiers, O for neutral entries and —1 for non-qualifiers. 

PDQ revised estimated win probability based on dual-qualifier mode). 

DAQ indicator is | for dual-or-asterisk qualifiers, 0 for neutral entries and —1 for non-qualifiers, 
PDAQ revised estimated win probability based on dual-or-asterisk qualifier model. 


6. The Betting Model 


Betting amounts were determined using the optimal capital growth model (OCGM) which 
maximizes the expected logarithm of wealth on a race-by-race basis. This approach was developed 
by Kelly (1956) (and is commonly called the ‘Kelly criterion’), and was extended and rigorously 
proved by Breiman (1961). Among its properties are: (1) it maximizes the asymptotic growth rate 
of wealth; (2) it asymptotically minimizes the expected time to reach any specific sufficiently 
large wealth level; and (3) in the long run it outperforms any other essentially different betting 
strategy almost surely and provides infinitely more final wealth than any other essentially different 
strategy. (See MacLean et al., 1992, 2006; Rotando and Thorp, 1992; and Thorp, 2006 for further 
properties, and Ziemba and Hausch, 1986 for simulation results for shorter time horizons.) 

The revised probability of betting interest i winning based on dual-qualifier or dual-or-asterisk- 
qualifier status is p;. Let r; be the gross return per dollar bet based on the win odds established 
by the public. (As in Section 2, we are suppressing the superscript indicating the race number.) 
The OCGM requires solving the following optimization problem for each race 


m=) i=] 


N N N 
imi il 1- p ii t. f >0Yi=1,..., N d ;<1. 7 
miine DP af Ds + fr) s.t. fi i an Ds (7) 


The decision variable f; is the fraction of the current wealth to bet on betting interest i. Suppose 
that the betior’s initial wealth is w and betting interest i wins. Then w fir; is returned to the bettor 
after having invested w ie Fm, for a final wealth of w(1 — yei Ím + firi). The objective 
function determines the logarithm of final wealth for each betting interest winning, weighted by 
the probability of that betting interest winning. The actual initial wealth, w, can be disregarded 
in the formulation by having the decision variables be the fraction of wealth that is bet on each 
betting interest. The constraints comprise a budget constraint and non-negativity.° 

This formulation assumes that the bets are sufficiently small so that they do not infiuence the 
payout on any betting interest, i.e. the bets on betting interest i do not reduce r;. For the Kentucky 
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Derby, the win betting pool is so large that a typical bet is very unlikely to influence payouts.!° 
The large pools also permit the assumption that the percentage bet on each betting interest varies 
little in the final few minutes. 

Revised probabilities for the base-case model based on Equations 3 and 5 are sufficiently close 
to the public’s win probabilities that expected returns are negative on all betting interests over the 
period 1981-2005. Solving Equation 7 with these revised probabilities leads to no bets. 

For the models based on Equation 6 and on status as a dual or dual-or-asterisk qualifier, the 
betting started with an initial wealth of $2,500 in 1981. It was updated after each year based on 
the bets made and the actual outcome of the race. The wealth history for betting over 1981 to 
2005 is shown in Figure 4 for dual qualifiers and for dual-or-asterisk qualifiers. Overall results are 
summarized in Table 4. For the years up to the mid-1980s any advantage identified by the model 
is small and results in small bets. Comparing the values in Figure 4 with those in Figures 2 and 
3 shows that wy, and By, are relatively close over that interval. As the model predicts a greater 
advantage, the amount per bet, and consequently the volatility, grows. 

For comparison, betting $2,500 on the favorite to win from 1981 to 2005 would yield a loss of 
$41,500; betting $200 to win on each dual qualifier would yield a profit of $12,920 on $13,200 
bet; and a $200 bet to win on each dual-or-asterisk qualifier would yield a profit of $7,780 on 
$23,000 bet (neutral entries excluded as qualifiers). The improved return on investment compared 
to the OCGM in the short run is helped considerably by a few huge profits on qualifiers Gato Del 
Sol (1982), Ferdinand (1986), and Thunder Gulch (1995). 
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v  Dual-or-Asterisk Qualifiers 
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Figure 4. Wealth level history for Kelly win bets, 1981-2005 
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Table 4. Profits from both models based on Kelly bets and revised 
win probabilities 


Model based on qualifier type 


Dual Dual-or-asterisk 

Number of bets 61 107 
Total amount bet $32,828 $66,467 
Number of bet that won 9 14 
Initial wealth $2,500 $2,500 
Final wealth $5,514 $4,889 
Total profit $3,014 $2,389 
Percentage return on investment 9.2 3.6 


For both models the betting scheme produced profits over the 1980s and up to the mid-1990s, 


but the performance has been poor since. Several possibilities can be considered for this: 


. The sample space is too small. This could mean that the sequence of successes for qualifiers 


for both models from 1972 to 1997 was a short-term run, so that in the long run there is nothing 
to be gained from using either model developed here. 

The limited sample space also implies that the final wealth is sensitive to individual race 
results. To give an idea of the scope, two extreme examples are (i) if a non-dual-qualifier had 
won in 1995, instead of Thunder Gulch, the final wealth for the dual-qualifier model would be 
$2,221, i.e. a slight loss overall, and (ii) if 2005 winner Giacomo had a favorable change of a 
single dosage point in any category, he would have been a dual qualifier and the final wealth 
would have been $13,877 for the dual-qualifier model. 


. It is extremely difficult to make a proper assessment of all of the two-year-olds, so the EFH 


can omit suitable horses. A prominent example occurred in 2003 where eventual 2004 Derby 
winner Smarty Jones was not rated on the EFH. Yet, he had overwhelmed a field of state- 
bred two-year-olds in a stakes race at Philadelphia Park, but that race is not counted when 
determining the EFH. Roman (200Sa) lists other Derby winners such as Winning Colors and 
Sunday Silence who were superior two-year-olds but were not rated on the EFH. 


. Classification of chefs is an ongoing exercise. For example, Alydar was classified as a chef 


subsequent to Strike The Gold winning the Derby in 1991, so Strike The Gold, who won the 
Blue Grass Stakes, is not considered as an asterisk qualifier here (DI = 9.00), yet when Alydar 
was classified, Strike The Gold's DI was reduced to 2.60. Ziemba (1991) wrote a column about 
this prior to the 1991 Derby arguing that Alydar should be a classic chef as he had numerous 
classic distance winners. The classification of Alydar, and at what point, would possibly change 
other dosage indices. However, here we go with Roman’s classification so Strike the Gold is 
not an asterisk qualifier. 


. Roman (2005b) has pointed out the gradual rise in the DZ of Derby winners over time, so the 


failure of the system in the last few years of the study may reflect a shift of the overall breed 
in North America towards speed at shorter distances. Real Quiet (1998), Charismatic (1999) 
and Giacomo (2005) all had D/ values greater than 4.00. 


. The Flamingo Stakes decreased in significance in the final years that it was run. Including the 


Flamingo winner as an asterisk qualifier in recent years was unwarranted in retrospect. Two 
possible solutions are to drop the Flamingo at some point in the analysis, or switch to the 
Arkansas Derby as the fifth significant prep race. 
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Random betting generates expected losses in excess of 16%, due to the 16% track take plus 
breakage. Our results showed both qualifier designations approximately doubling wealth over the 
betting period. However, in light of the variation in wealth displayed in Figure 4, it is important to 
address the statistical significance of these profits. We do so with two approaches. The first treats a 
betting interest’s win or loss as a binomial random variable and then uses a Normal approximation. 
The second approach simulates the set of races assuming random wagering. 

Before considering the first approach, observe that the data in Figure 4 are not ideal for address- 
ing the statistical significance. Wealth generally grows until the mid-to-late 1990s, and then 
dramatically falls. This pattern of wins and losses leads to wealth that is highly variable. Focusing 
on just the dual-qualifier case, Figure 5 superimposes on Figure 4 the wealth level history assum- 
ing that the races were run in reverse order, i.e. we started in 2005 with $2,500, then updated 
wealth based on our results in 2005 and went to the 2004 race, and so on. Thus, the string of large 
losses occurs early with lower wealth, after which wins are more common. The final wealth is 
identical, since the optimal capital growth model simply determines the optimal fraction of wealth 
to bet each year. (For example, losing 10% one year and gaining 20% the other year leads to an 
overall return of 8%, whatever the order of the win and the loss.) Despite the final wealth being 
the same, the wealth histories are very different, as is the appearance of any statistical significance 
to the profits. 

Our goal is to assess the profitability of this system. To eliminate the effect of varying wealth, 
which can dampen or intensify the variance in profits, our test of statistical significance uses bets 


Wealth After Betting for Each Year ($) 


1980 1985 1990 1995 2000 2005 2010 


Figure 5. Betting wealth for Kelly win bets on dual qualifiers with races run forward (1981-2005) and run 
backwards (2005-1981) 
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and returns in each race assuming an identical initial wealth each race. We do not update wealth 
year by year (as in Figure 4); instead the initial wealth each year is assumed to be $2,500., 

Let q be the probability of winning a bet in each trial, n be the number of trials, c be the amount 
wagered each trial, and r be the gross return upon winning,!! and let X be the random variable 
representing the number of wins. The probability of profits exceeding a constant x is 


P[rX —nce>}. (8) 


Assume that the trials are independent. For bets in different races, this assumption is reasonable. 
For multiple bets on the same race — which are common -— wins are negatively correlated since 
if one betting interest wins then the others must lose. Negative correlation leads to a tighter 
distribution of wins, so in this way our analysis based on independent trials will underestimate the 
statistical significance of our results. Since X is binomially distributed, the Normal distribution 
approximates equation 8 as 


1-0[ ae (9) 


vnq(l — q) 


where ® is the cumulative distribution function of a standard N(0,1) variable. 

For dual qualifiers and assuming an initial wealth of $2,500 each year, there were 61 bets totaling 
$7,079, and of these bets, 9 won for a gross return of $11,357 and a profit of $4,278. Thus, 
c = 7,079/61 = 116.0 and r = 11, 357/9 = 1,262. If the system were no better than random 
betting, then q satisfies rq — c = —0.16c, recognizing the 16% track take, giving q = 0.07725. 
Then, by Equation 9, the probability of profits of at least the observed level of $4,278 given 
random betting is 2.0%. Suppose instead that the system is better than random betting but only 
good enough to offer zero expected profits. Then q solves rq — c = 0, or q = 0.09192, and, by 
Equation 9, the probability that such a system would produce at least the observed profits is 6.7%. 

For dual-and-asterisk qualifiers, with initial wealth of $2,500 each year, there were 107 bets 
totaling $13,268; 14 of these bets won for a gross return of $18,253 and a profit of $4,985. Thus, 
c = 124 andr = 1,304. If the system were no better than random betting, then q = 0.07988 and 
the probability of profits of at least the observed level is only 2.6%. Assuming instead that the 
system is better than random betting but only good enough to offer zero expected profits, then 
q = 0.09509 and the probability that such a system would produce at least the observed profits 
is 10.4%. 

Our second approach to addressing the statistical significance of the results involved two simu- 
lations for each qualifier designation. The first simulation dealt with the question of how likely it 
would be that profits at the observed level would have been generated if our approach is vacuous 
and, therefore, is essentially nothing beyond random wagering. The second simulation asked how 
likely it would be that the observed profits would have been generated if the system is able to 
improve upon random wagering, but only enough to achieve zero expected return on each wager 
(excluding breakage). The algorithm for the first simulation was 


1. Start with a betting wealth of $2,500 in 1981. 

2. Determine the fraction of wealth to wager on each betting interest i for the current year based 
on the Kelly criterion and the (wrong) assumption that our probability estimate, p;, is correct. 

. Randomly select the winner, with the probability of winning for betting interest i being q;. 

. Based on the simulated winner, its payout and our wagers, update wealth. 

. Repeat steps 2 to 4 for each year in order up to 2005. 


Un & Ww 
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Table 5. Results from 10,000 betting simulations with $2,500 initial wealth 


Dual qualifier Dual-or-asterisk qualifier 


Simulation 1 Simulation 2 Simulation 1 Simulation 2 


Final wealth < $1,000 (%) 54.3 37.0 73.8 49.3 
Final wealth < $2,500 (%) 84.2 72.6 91.5 76.1 
Final wealth > System’s 

Final wealth (Table 4) (%) 3.9 9.2 3.1 11.4 
Mean final wealth $1,555 $2,449 $1,030 $2,473 
Median final wealth $892 $1,371 $450 $1,022 


Maximum final wealth $67,601 $161,777 $97,336 $409,294 


The second simulation differed only in step 3, where the simulation used q;/ Q as the correct 
win probability for any betting interest i that received a wager in step 2. The expected return on 
every wager was zero, before accounting for breakage. The collective win probability of the other 
betting interests was such that probabilities summed to unity. For example, in a three-horse race 
with [q1, 42, 93] = [0.42, 0.21, 0.37] and Kelly bets having been placed on the first two horses 
based on [p;, p2, p3], the probability that the simulation would select each betting interest as the 
winner was [0.42, 0.21, 0.37] for Simulation 1, and was [0.5, 0.25, 0.25] for Simulation 2 after 
dividing the first two fractions by the track payback of 0.84. The simulations were run 10,000 
times each. The results are shown in Table 5. 

For all simulations, losses occurred more than 70% of the time. For Simulation 2 and for 
both types of qualifiers, the mean final wealth was close to $2,500, which is expected given the 
modification in step 3 of this simulation. For dual qualifiers, only 3.9% of the time did Simulation 
1 realize profits as high as our observed profit. For Simulation 2, the corresponding value is 
9.2%. For dual-or-asterisk qualifiers, these values for Simulations 1 and 2 are, respectively, 3.1% 
and 11.4%. 

As a final test the analyses were conducted using limited data. For example, if the year being 
considered was 1997 and the interval was 25 years, only information from 1972 to 1996 would 
have been applied. For the dual-qualifer model, using the entire data set produced the greatest 
final wealth; while for the dual-or-asterisk model, using an interval of 56 years produced a final 
wealth of $5,305. 


7. Conclusions 


The racetrack is a useful financial market for testing market efficiency and considerable evidence 
exists, including results in this paper, in support of the track’s win market being weak-form 
efficient, i.e. no profitable system can be developed based on the odds established by the public. 
This paper tests whether the win market is also semi-strong efficient, i.e. no profitable system can 
be developed based on the public’s odds and other publicized information. 

We focus on a particular aspect of the Kentucky Derby, which is that the Derby’s distance of 
14 miles is typically farther than any entrant has ever raced. This lack of direct evidence of an 
entrant’s stamina for this race has motivated the search for indirect evidence. Dosage Theory, which 
analyses a horse’s pedigree, has been offered as such evidence but it has also been controversial, 
both in general and in its relation to the Kentucky Derby. Other evidence that has been offered 
includes well-publicized rankings of horses and results from recent high-caliber races. 
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Our goal has not been to evaluate the criticisms or the justifications offered for the dosage 
concept and for the ranking of two-year-olds, nor do we attempt to refine their application to 
the Kentucky Derby. Instead, we developed a model that takes this information, which is readily 
available to the public and receives much attention, and merges it with the public’s win odds to 
establish win probabilities. We then tested these win probabilities within a betting system based 
on the optimal capital growth model and showed statistically significant profits. 

A specific application of the procedure would be for the 15-mile Belmont Stakes which is run 
weeks after the Kentucky Derby. This analysis is the planned focus of future work. It is known 
though from our preliminary analysis that during the period in the 1980s to mid-1990s when 
the dual qualifiers were having very good success in the Derby their results in the Belmont were 
not as good. However, in recent years the situation has reversed with much better results in the 
Belmont than the Derby. In a more general context, the betting systems detailed here are two of 
many ‘angles’ used by bettors. The procedure outlined shows that given the pools from a set of 
races for which the angle is applicable, the simple model given in Equation 6 can be used to test 
the efficiency of the market with respect to the angle. 
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Notes 


An exception may be extreme favorites at odds of 3-10 or less, which, while quite rare, have been shown to produce 
a small average profit (Ziemba and Hausch, 1986). Inefficiencies in other more complex markets are more common; 
see Hausch et al. (1994) for such evidence. 

Large bettors can reduce this take by betting at rebate sites that return a portion of the bet to make the actual take 
about 10%. We do not deal with such bettors here nor with those outside the USA who wager on Betfair or other 
betting exchanges against other bettors directly rather than in a pari-mutuel pool as discussed here. 

Our notation deals with one race only. In Section 5, to deal with several races simultaneously, we will add a superscript 
to our notation to identify the race number. 

For each track there is a minimum payout, usually 5%, that the track must return even if there are insufficient funds 
available in QW. 

Mares are not included because they are considered to have too few offspring to identify distance proficiencies, while 
it is not unusual for a stallion to sire over 100 offspring in a year. 

Ranking horses by weight is a familiar concept at the racetrack. In handicap races, the top horses carry greater weight 
(jockey + saddle + additional weights if necessary) than the less-qualified horses. Handicapping of this sort occurs 
only in select races and is intended to make the race more competitive. 

Some people expand the dual qualifier category to include any horse that is declared a champion in a country other 
than the USA and has a D/ less than or equal to 4.0. In this paper, only the definition as given. 

Griffith (1949), McGlothlin (1956), Ali (1977) Asch et al. (1982) and Ziemba and Hausch (1986), among others, have 
demonstrated that the public’s wagering has a strong and stable bias of underbetting the favorites and overbetting the 
longshots. This results in ô > 1.0. Ziemba and Hausch (1987) provided evidence that this ‘favorite-longshot bias’ is 
exhibited at the Kentucky Derby but it is weaker, that is more flat, than in these earlier studies. The recent advent of 
rebate and betting exchange wagering has led to a flattening of the favorite-longshot bias in recent data since about 
1998 (Ziemba, 2004). 

The solution for Equation 7 was obtained using the Fortran package DONLP2 written by Peter Spellucci which is 
available from NetLib (http: //www.netlib.org). 

See Hausch et al. (1981) for a formulation that does account for the bettor’s effect on payouts. 
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1 In practice, q, c, and r vary across races and even within races if there are multiple wagers. We approximate the 
sequence of wagers by using the average values of these parameters. 
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Appendix: Data Sources 
(1) Public’s Wagering 


For races from 1946 to 1991, betting data were taken from tables published in The Courier-Journal, 
a Louisville Kentucky newspaper, usually on the Sunday after the Kentucky Derby. The pools for 
1970 could not be found. There were several discrepancies in the data for which the published pools 
did not sum to the totals, or did not correspond with published win odds. Adjustments were made 
for errors for which an apparent revision could be made. For 1999-2002, complete pools were 
obtained from the Bloodstock Research Information Services Web site http://www.bris.com. In 
2001, the pools also appeared on the Web site for the home of the Kentucky Derby, Churchill 
Downs, http: //www.churchilldowns.com. The 2003 pools were sent directly by Churchill Downs, 
and the 2004 and 2005 pools were obtained courtesy of John Swetye who obtained them from 
Philadelphia Park’s (Bensalem Pennsylvania) Phonebet service. From 1992 to 1998, the pools 
recorded in The Courier-Journal did not have all of the bets included. While these totals are not 
available, win odds based on total wagering are available. Therefore, for 1970 and 1992-1998 we 
estimated the total win pool and backed out a set of win pool fractions that are consistent with the 
published win odds. There were no dual qualifiers in 1998 and 2003 so those years were excluded 
from the dual-qualifier modeling. 


(2) Pedigrees 


Pedigree information was taken from The Blood-Horse magazine, the American Produce 
Records, a software database called The Pedigree Program, the pedigree query Web site 
http://owl.netscout.com/pedigree (no longer active), the Del Mar Turf Club Web site 
http: //www.dmtc.com/dmtc98/Pedigree/, thoroughbred registries, and Roman’s Web site 
http: //www.chef-de-race.com. The 2004 data were sent by personal communication from Roman 
to John Swetye who forwarded them to us. 


(3) Chef-de-Race Listings 


Classifications of chefs were taken from the original 1981 list (Roman, 2000), the American 
Racing Manual for each year from 1986 (the first year that the list was included) to 1998, and 
from Roman’s Web site. For the period 1981—1986, the 1981 list was used. For years prior to 
1981, the original list was used. For 2000 to 2003, and 2005, Dosage Indices and EFH rankings 
tabulated by Roman were taken from his Web site, http://www.chef-de-race.com, and for 2004 
they were sent via e-mail from Roman (see (2)). 
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(4) Experimental Free Handicap Listings 


The EFH listings were taken from the American Racing Manual, The Blood-Horse magazine 
(print and on line http://www.bloodhorse.com), the Thoroughbred Times Web site http:// 
www.thoroughbredtimes.com, and Roman’s Web site http://www.chef-de-race.com. 


(5) Historical Results of the Kentucky Derby and Major Races Prior to the Kentucky Derby 


The results of the Kentucky Derby were taken from the Daily Racing Form, both print and on line 
(http://www.drf.com), press materials from Churchill Downs, and from Chew (1974). Recent 
results charts were obtained from the following Web sites: 


About.com Inc. http: //horseracing.about.com 
Sportsline.com Inc. http: //www.sportsline.com 
CNN/Sports Illustrated http: //sportsillustrated.cnn.com 
Equibase Company, LLC __http://www.equibase.com 
Daily Racing Form, LLC _http://www.drf.com 


The results for the major races prior to the Derby were taken from the American Racing Manual 
and lists from the following Web sites: 


Blue Grass Stakes http: //www.keeneland.com/liveracing/history.asp 

Flamingo Stakes http://hialeahpark.com/99/HallofFame/flamingo.htm 

Florida Derby http://www.thoroughbredchampions.com/library /fladerby.htm 
Santa Anita Derby http: //www.revistahipodromo.com/santaanita.html 


Wood Memorial Stakes —_http://www.nyra.com/aqueduct/index2.html 


and from the Thoroughbred Times current web site http://www.thoroughbredtimes.com and 
their out-of-date site http://www.iglou.com/tbred/tc97/preps. The sites for Hialeah Park and 
the Venezuelan site revistahipodromo.com are out of date now. 
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